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PREFACE TO THE FIRST EDITION 


This book is a considerable expansion of lectures I gave at the School of 
Mathematical and Physical Sciences, University of Sussex during the winter 
term of 1986. The audience included postgraduate students and faculty members 
working in particle physics, condensed matter physics and general relativity. The 
lectures were quite informal and I have tried to keep this informality as much as 
possible in this book. The proof of a theorem is given only when it is instructive 
and not very technical; otherwise examples will make the theorem plausible. 
Many figures will help the reader to obtain concrete images of the subjects. 

In spite of the extensive use of the concepts of topology, differential ge- 
ometry and other areas of contemporary mathematics in recent developments in 
theoretical physics, it is rather difficult to find a self-contained book that is easily 
accessible to postgraduate students in physics. This book is meant to fill the gap 
between highly advanced books or research papers and the many excellent intro- 
ductory books. As a reader, I imagined a first-year postgraduate student in theo- 
retical physics who has some familiarity with quantum field theory and relativity. 
In this book, the reader will find many examples from physics, in which topo- 
logical and geometrical notions are very important. These examples are eclectic 
collections from particle physics, general relativity and condensed matter physics. 
Readers should feel free to skip examples that are out of their direct concern. 
However, I believe these examples should be the theoretical minima to students 
in theoretical physics. Mathematicians who are interested in the application of 
their discipline to theoretical physics will also find this book interesting. 

The book is largely divided into four parts. Chapters 1 and 2 deal with the 
preliminary concepts in physics and mathematics, respectively. In chapter 1, 
a brief summary of the physics treated in this book is given. The subjects 
covered are path integrals, gauge theories (including monopoles and instantons), 
defects in condensed matter physics, general relativity, Berry's phase in quantum 
mechanics and strings. Most of the subjects are subsequently explained in detail 
from the topological and geometrical viewpoints. Chapter 2 supplements the 
undergraduate mathematics that the average physicist has studied. If readers are 
quite familiar with sets, maps and general topology, they may skip this chapter 
and proceed to the next. 

Chapters 3 to 8 are devoted to the basics of algebraic topology and 
differential geometry. In chapters 3 and 4, the idea of the classification of spaces 
with homology groups and homotopy groups is introduced. In chapter 5, we 


define a manifold, which is one of the central concepts in modern theoretical 
physics. Differential forms defined there play very important roles throughout this 
book. Differential forms allow us to define the dual of the homology group called 
the de Rham cohomology group in chapter 6. Chapter 7 deals with a manifold 
endowed with a metric. With the metric, we may define such geometrical 
concepts as connection, covariant derivative, curvature, torsion and many more. 
In chapter 8, a complex manifold is defined as a special manifold on which there 
exists a natural complex structure. 

Chapters 9 to 12 are devoted to the unification of topology and geometry. 
In chapter 9, we define a fibre bundle and show that this is a natural setting 
for many physical phenomena. The connection defined in chapter 7 is naturally 
generalized to that on fibre bundles in chapter 10. Characteristic classes defined 
in chapter 1 1 enable us to classify fibre bundles using various cohomology 
classes. Characteristic classes are particularly important in the Atiyah-Singer 
index theorem in chapter 12. We do not prove this, one of the most important 
theorems in contemporary mathematics, but simply write down the special forms 
of the theorem so that we may use them in practical applications in physics. 

Chapters 13 and 14 are devoted to the most fascinating applications of 
topology and geometry in contemporary physics. In chapter 13, we apply the 
theory of fibre bundles, characteristic classes and index theorems to the study of 
anomalies in gauge theories. In chapter 14, Polyakov’s bosonic string theory is 
analysed from the geometrical point of view. We give an explicit computation of 
the one-loop amplitude. 

I would like to express deep gratitude to my teachers, friends and students. 
Special thanks are due to Tetsuya Asai, David Bailin, Hiroshi Khono, David 
Lancaster, Shigeki Matsutani, Hiroyuki Nagashima, David Pattarini, Felix E A 
Pirani, Kenichi Tamano, David Waxman and David Wong. The basic concepts 
in chapter 5 owe very much to the lectures by F E A Pirani at King’s College, 
University of London. The evaluation of the string Laplacian in chapter 14 using 
the Eisenstein series and the Kroneckerlimiting formula was suggested by T Asai. 
I would like to thank Euan Squires, David Bailin and Hiroshi Khono for useful 
comments and suggestions. David Bailin suggested that I should write this book. 
He also advised Professor Douglas F Brewer to include this book in his series. I 
would like to thank the Science and Engineering Research Council of the United 
Kingdom, which made my stay at Sussex possible. It is a pity that I have no 
secretary to thank for the beautiful typing. Word processing has been carried out 
by myself on two NEC PC9801 computers. Jim A Revill of Adam Hilger helped 
me in many ways while preparing the manuscript. His indulgence over my failure 
to meet deadlines is also acknowledged. Many musicians have filled my office 
with beautiful music during the preparation of the manuscript: I am grateful to 
J S Bach, Ryuichi Sakamoto, Ravi Shankar and Erik Satie. 

Mikio Nakahara 

Shizuoka, February 1989 


PREFACE TO THE SECOND EDITION 


The first edition of the present book was published in 1990. There has been 
incredible progress in geometry and topology applied to theoretical physics and 
vice versa since then. The boundaries among these disciplines are quite obscure 
these days. 

I found it impossible to take all the progress into these fields in this second 
edition and decided to make the revision minimal. Besides correcting typos, errors 
and miscellaneous small additions, I added the proof of the index theorem in terms 
of supersymmetric quantum mechanics. There are also some rearrangements of 
material in many places. 1 have learned from publications and internet homepages 
that the first edition of the book has been read by students and researchers from a 
wide variety of fields, not only in physics and mathematics but also in philosophy, 
chemistry, geodesy and oceanology among others. This is one of the reasons 
why 1 did not specialize this book to the forefront of recent developments. I 
hope to publish a separate book on the recent fascinating application of quantum 
field theory to low dimensional topology and number theory, possibly with a 
mathematician or two, in the near future. 

The first edition of the book has been used in many classes all over the world. 
Some of the lecturers gave me valuable comments and suggestions. I would like 
to thank, in particular, Jouko Mikkelsson for constructive suggestions. Kazuhiro 
Sakuma, my fellow mathematician, joined me to translate the first edition of the 
book into Japanese. He gave me valuable comments and suggestions from a 
mathematician’s viewpoint. I also want to thank him for frequent discussions 
and for clarifying many of my questions. I had a chance to lecture on the material 
of the book while I was a visiting professor at Helsinki University of Technology 
during fall 2001 through spring 2002. I would like to thank Martti Salomaa for 
warm hospitality at his materials physics laboratory. Sami Virtanen was the course 
assistant whom I would like to thank for his excellent work. I would also like to 
thank Juha Vartiainen, Antti Laiho, Teemu Ojanen, Teemu Keski-Kuha, Markku 
Stenberg, Juha Heiskala, Tuomas Hytonen, Antti Niskanen and Ville Bergholm 
for helping me to find typos and errors in the manuscript and also for giving me 
valuable comments and questions. 

Jim Revill and Tom Spicer of IOP Publishing have always been generous 
in forgiving me for slow revision. I would like to thank them for their generosity 
and patience. I also want to thank Simon Laurenson for arranging the copyediting, 
typesetting and proofreading and Sarah Plenty for arranging the printing, binding 



and scheduling. The first edition of the book was prepared using an old NEC 
computer whose operating system no longer exists. I hesitated to revise the 
book mainly because I was not so courageous as to type a more-than-500-page 
book again. Thanks to the progress of information technology, IOP Publishing 
scanned all the pages of the book and supplied me with the files, from which I 
could extract the text files with the help of optical character recognition (OCR) 
software. I would like to thank the technical staff of IOP Publishing for this 
painstaking work. The OCR is not good enough to produce the LTgX codes for 
equations. Mariko Kamada edited the equations from the first version of the book. 
I would like to thank Yukitoshi Fujimura of Peason Education Japan for frequent 
TpX-nical assistance. He edited the Japanese translation of the first edition of the 
present book and produced an excellent DTgX file, from which I borrowed many 
LTgX definitions, styles, diagrams and so on. Without the Japanese edition, the 
publication of this second edition would have been much more difficult. 

Last but not least, I would thank my family to whom this book is dedicated. 
I had to spend an awful lot of weekends on this revision. I wish to thank my 
wife, Fumiko, and daughters, Lisa and Yuri, for their patience. I hope my 
little daughters will someday pick up this book in a library or a bookshop and 
understand what their dad was doing at weekends and late after midnight. 

Mikio Nakahara 

Nara, December 2002 



HOW TO READ THIS BOOK 


As the author of this book, I strongly wish that this book is read in order. However, 
I admit that the book is thick and the materials contained in it are diverse. Here 
I want to suggest some possibilities when this book is used for a course in 
mathematics or mathematical physics. 

(1) A one year course on mathematical physics: chapters 1 through 10. 
Chapters 1 1 and 12 are optional. 

(2) A one-year course on geometry and topology for mathematics students: 
chapters 2 through 12. Chapter 2 may be omitted if students are familiar with 
elementary topology. Topics from physics may be omitted without causing 
serious problems. 

(3) A single-semester course on geometry and topology: chapters 2 through 
7. Chapter 2 may be omitted if the students are familiar with elementary 
topology. Chapter 8 is optional. 

(4) A single-semester course on differential geometry for general relativity: 
chapters 2, 5 and 7. 

(5) A single-semester course on advanced mathematical physics: sections 1.1- 
1.7 and sections 12.9 and 12.10, assuming that students are familiar with 
Riemannian geometry and fibre bundles. This makes a self-contained course 
on the path integral and its application to index theorem. 

Some repetition of the material or a summary of the subjects introduced in 
the previous part are made to make these choices possible. 


NOTATION AND CONVENTIONS 


The symbols N, Z , Q, M and C denote the sets of natural numbers, integers, 
rational numbers, real numbers and complex numbers, respectively. The set of 
quaternions is defined by 

M = {a + bi + cj + dk\ a, b, c, d G R] 

where (1, i, j , k) is a basis such that i ■ j = —j ■ i — k, j ■ k — —k ■ j — i, 
k -i = —i -k — j ,i 2 — j 2 = k 2 — — 1. Note that i, j and k have the 2x2 matrix 
representations i = ia-3, j = iao- k = irr 1 where cr,- are the Pauli spin matrices 



The imaginary part of a complex number 7, is denoted by Im z while the real part 
is Rez. 

We put c (speed of light) = h (Planck’s constant/27r) = A'b (Boltzmann’s 
constant) = 1, unless otherwise stated explicitly. We employ the Einstein 
summation convention: if the same index appears twice, once as a superscript 
and once as a subscript, then the index is summed over all possible values. For 
example, if // runs from 1 to m, one has 

m 

A 11 B tl = 

11=1 

The Euclid metric is g^ lv — S ^ = diag(+ 1 , . . . , + 1) while the Minkowski metric 
is g^v = Vllv = diag(— 1, +1 +1). 

The symbol □ denotes ‘the end of a proof’. 
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QUANTUM PHYSICS 


A brief introduction to path integral quantization is presented in this chapter. 
Physics students who are familiar with this subject and mathematics students who 
are not interested in physics may skip this chapter and proceed directly to the next 
chapter. Our presentation is sketchy and a more detailed account of this subject 
is found in Bailin and Love (1996), Cheng and Li (1984), Huang (1982), Das 
(1993), Kleinert (1990), Ramond (1989), Ryder (1986) and Swanson (1992). We 
closely follow Alvarez (1995), Bertlmann (1996), Das (1993), Nakahara (1998), 
Rabin (1995), Sakita (1985) and Swanson (1992). 


1.1 Analytical mechanics 

We introduce some elementary principles of Lagrangian and Hamiltonian 
formalisms that are necessary to understand quantum mechanics. 


1.1.1 Newtonian mechanics 


Let us consider the motion of a particle m in three-dimensional space and let x (r) 
denote the position of m at time t . 1 Suppose this particle is moving under an 
external force F{x). Then x(t) satisfies the second-order differential equation 


m 


d 2 x(t) 


= F(x(t)) 


( 1 . 1 ) 


called Newton’s equation or the equation of motion. 

If force F(x) is expressed in terms of a scalar function V (x) as Fix) — 
— VV(x), the force is called a conserved force and the function V (x ) is called 
the potential energy or simply the potential. When F is a conserved force, the 
combination 


E 


m 

~2 



+ V(x) 


( 1 . 2 ) 


is conserved. In fact, 


d£ 

dr 


E 


d xt d 2 Xk 

m T - 

dr dr 2 


dV d xk 
dxk dr 



We call a particle with mass m simply ‘a particle m\ 


<) V \ dxk 
3 x I dr 


i 



where use has been made of the equation of motion. The function E, which is 
often the sum of the kinetic energy and the potential energy, is called the energy. 


Example 1.1. (One-dimensional harmonic oscillator) Let x be the coordinate 
and suppose the force acting on m is F(x) = —kx, k being a constant. This force 
is conservative. In fact, V (x) — \kx 2 yields F(x ) = —dV(x)/dx = —kx. 
In general, any one-dimensional force Fix) which is a function of x only is 
conserved and the potential is given by 



An example of a force that is not conserved is friction F = — qdx/dt. We 
will be concerned only with conserved forces in the following. 

1.1.2 Lagrangian formalism 

Newtonian mechanics has the following difficulties; 

1 . This formalism is based on a vector equation (1.1) which is not very easy to 
handle unless an orthogonal coordinate system is employed. 

2. The equation of motion is a second-order equation and the global properties 
of the system cannot be figured out easily. 

3. The analysis of symmetries is not easy. 

4. Constraints are difficult to take into account. 

Furthermore, quantum mechanics cannot be derived directly from 
Newtonian mechanics. The Lagrangian formalism is now introduced to overcome 
these difficulties. 

Let us consider a system whose state (the position of masses for example) 
is described by N parameters {g,} (1 < i < N). The parameter is an element 
of some space Mr The space M is called the configuration space and the {q,} 
are called the generalized coordinates. If one considers a particle on a circle, for 
example, the generalized coordinate q is an angle 0 and the configuration space 
M is a circle. The generalized velocity is defined by ip — dq, /dr. 

The Lagrangian L(q,q) is a function to be defined in Hamilton’s 
principle later. We will restrict ourselves mostly to one-dimensional space but 
generalization to higher-dimensional space should be obvious. Let us consider 
a trajectory q(t ) ( t e [f/,f/]) of a particle with conditions q(t,) = q , and 
q(tf) = qf. Consider a functional 3 


r tf 

S[q(t),q(t)]= / L(q,q)dt 


(1.3) 


u 


“ A manifold, to be more precise, see chapter 5. 

3 A functional is a function of functions. A function /(•) produces a number fix) for a given number 
x. Similarly, a functional F[*] assigns a number F[f] to a given function f(x). 


called the action. Given a trajectory q(t ) and q(t), the action . q] produces 
a real number. Hamilton’s principle, also known as the principle of the least 
action, claims that the physically realized trajectory corresponds to an extremum 
of the action. Now the Lagrangian must be chosen so that Hamilton’s principle is 
fulfilled. 

It turns out to be convenient to write Hamilton’s principle in a local form 
as a differential equation. Suppose q{t) is a path realizing an extremum of S. 
Consider a variation Sq(t) of the trajectory such that <5 qitj) = 8q(tf ) = 0. The 
action changes under this variation by 


-l 

-L 


L(q + Sq, q + Sq ) dr - 
8q dr 


/"■ 


L(q , q) dr 


•f / dL d dL 

\ 3 q dr dq 


(1.4) 


which must vanish because q yields an extremum of S. Since this is true for any 
8q, the integrand of the last line of (1.4) must vanish. Thus, the Euler-Lagrange 
equation 


3 L d 3 L 
dq dr dq 


(1.5) 


has been obtained. If there are N degrees of freedom, one obtains 


dL d dL 

= 0 (1 < k < N). 

dqk dr dqk 


( 1 . 6 ) 


If we introduce the generalized momentum conjugate to the coordinate q <_• 


by 


Pk = 


dL 

dqk 

the Euler-Lagrange equation takes the form 


d pk dL 
dr dqk 


(1.7) 


( 1 . 8 ) 


By requiring this equation to reduce to Newton’s equation, one quickly finds the 
possible form of the Lagrangian in the ordinary mechanics of a particle. Let us 
put L = \mq 2 — V (q ). By substituting this Lagrangian into the Euler-Lagrange 
equation, it is easily shown that it reduces to Newton’s equation of motion. 


3 V 

mqu + - — = 0. 

dqk 


(1.9) 


Let us consider the one-dimensional harmonic oscillator for example. The 
Lagrangian is 


L(x, i) = 


\kx 2 


( 1 . 10 ) 



from which one finds mx + kx — 0. 

It is convenient for later purposes to introduce the notion of a functional 
derivative. Let us consider the case with a single degree of freedom for simplicity. 
Define the functional derivative of 5 with respect to q by 


= lim . 

Sq(s ) s->o e 


( 1 . 11 ) 


Since 

5 


q(t) + sS(t — s ), q(t) + e — S(t — s) 
at 


/' 


— I dt L\ q(t) + sS(t — s ), q{t) + s — 8 (t — .y) 

dr 


-/ 


= / d t L(q, q) + £ 


/-(S 


dL d 

5(r-i) + — +0(e‘ ) 

oq dr 


/dL d dL 

= S[q, q] + £ — (s) - — — (S') 
dq dr dq 


0(B 2 ), 


the Euler-Lagrange equation may be written as 

8S dL d / dL\ 

8q(s ) ~ dq (X) ~ dr (,9^ J (S) ~ °' 


( 1 . 12 ) 


Let us next consider symmetries in the context of the Lagrangian formalism. 
Suppose the Lagrangian L is independent of a certain coordinate q^/ Such 
a coordinate is called cyclic. The momentum which is conjugate to a cyclic 
coordinate is conserved. In fact, the condition dL/dqk = 0 leads to 

dw d dL dL 

— = ^ = = 0. (1.13) 

dr dr dqk dqk 

This argument can be mathematically elaborated as follows. Suppose the 
Lagrangian L has a symmetry, which is continuously parametrized. This means, 
more precisely, that the action S — f dr L is invariant under the symmetry 
operation on qk(t). Let us consider an infinitesimal symmetry operation qu ( r ) -* 
qk(t) + 8qk(t ) on the path qk(t). 4 5 This implies that if qk(t) is a path producing 
an extremum of the action, then qk(t) — »• qk(t) + Sqk(t) also corresponds to an 
extremum. Since S is invariant under this change, it follows that 


8S = 




d 9L\ 
dr dqk) 


■E 


Sqk 


dL 

9qk 


= 0 . 


4 Of course, L may depend on q^. Otherwise, the coordinate is not our concern at all. 

5 Since the symmetry is continuous, it is always possible to define such an infinitesimal operation. 
Needless to say, 8q(ti ) and 8q(tf ) do not, in general, vanish in the present case. 



The first term in the middle expression vanishes since q is a solution to the Euler- 
Lagrange equation. Accordingly, we obtain 

X 8 1k(ti)pk(ti ) = X Sc lk(tf)pk(tf ) (1.14) 

k k 

where use has been made of the definition pk — dL/dqk . Since r, and tf 
are arbitrary, this equation shows that the quantity 8q k (t)pk(t) is, in fact, 
independent of t and hence conserved. 

Example 1.2. Let us consider a particle m moving under a force produced by a 
spherically symmetric potential V(r), where r, 0, (f> are three-dimensional polar 
coordinates. The Lagrangian is given by 

L — n[r 2 + r 2 {9 2 + sin 2 00 2 )] — V(r). 

Note that q k = <pis cyclic, which leads to the conservation law 

3 L 2 2 

8(f > — - oc mr shr 0<p — constant. 

d(p 

This is nothing but the angular momentum around the z. axis. Similar arguments 
can be employed to show that the angular momenta around the x and y axes are 
also conserved. 


A few remarks are in order: 


• Let Q(q) be an arbitrary function of q. Then the Lagrangians L and 
L + dQ/dt yield the same Euler-Lagrange equation. In fact. 


3 

3 qk 


T + de '' 
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3 L 
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dqk 

dq k dr 


dqk 

d t 

3 dQ 

d 3 Q _ 

0 . 



dq k dr dr 3 q k 



• An interesting observation is that Newtonian mechanics is realized as an 
extremum of the action but the action itself is defined for any trajectory. This 
fact plays an important role in path integral formation of quantum theory. 


1.1.3 Hamiltonian formalism 

The Lagrangian formalism yields a second-order ordinary differencial equation 
(ODE). In contrast, the Hamiltonian formalism gives equations of motion which 
are first order in the time derivative and, hence, we may introduce flows in the 



phase space defined later. What is more important, however, is that we can make 
the symplectic structure manifest in the Hamiltonian formalism, which will be 
shown in example 5.12 later. 

Suppose a Lagrangian L is given. Then the corresponding Hamiltonian is 
introduced via Legendre transformation of variables as 

#(<?, P) = y ^Pkqk ~ L{q, q ), (1.15) 

k 


where q is eliminated in the left-hand side (LHS) in favour of p by making use of 
the definition of the momentum p k = <)L(q , q)/dq k . For this transformation to 
be defined, the Jacobian must satisfy 


det 


/ d£i_ 
\dqj 



#0. 


The space with coordinates ( qk , pk) is called the phase space. 

Let us consider an infinitesimal change in the Hamiltonian induced by Sqk 
and 8pk, 


E 

k L 

= E 


dL dL 

Spkqk + PkSqk ~ - — 8q k - — - 8q k 
oqk oq k 


k L 

It follows from this relation that 


dL 

8pkqk - - — 8q k 
oqk 



8H _ dL 

dqk dqk 


(1.16) 


which are nothing more than the replacements of independent variables. 
Hamilton’s equations of motion are obtained from these equations if the Euler- 
Lagrange equation is employed to replace the LHS of the second equation. 


qk — 


dH 

dpk 


Pk = 


dH 

dqk 


(1.17) 


Example 1.3. Let us consider a one-dimensional harmonic oscillator with the 
Lagrangian L — \mq 2 — \inco 2 q 2 , where to 2 = k/m. The momentum conjugate 
to q is p = dL/dq — mq , which can be solved for q to yield q = p/m. The 
Hamiltonian is 

H (q. p) = pq~ L(q , q) = \ mu> 2 q 2 . (1.18) 

2m 2 

Hamilton’s equations of motion are: 


dp 

d t 


—marq 


dq_ = p_ 

d t m 


(1.19) 



Let us take two functions A(q, p ) and B(q, p ) defined on the phase space of 
a Hamiltonian H . Then the Poisson bracket [A, B] is defined by 6 


[A,B] = J2 


9A dB 9A 9£\ 
dq k dp k dpkdqkj 




( 1 . 20 ) 


Exercise 1.1. Show that the Poisson bracket is a Lie bracket, namely it satisfies 

[A, c\B\ + C 2 B 2 ] = ci[A, Z?i] + C 2 [A, Bt\ 

[A,B] = -[B,A] 

[[A, B], C ] + [[C, A], B] + [[£, C], A] = 0 

The fundamental Poisson brackets are 

[Pi, Pj] = [qi, qj] = 0 Ul 

It is important to notice that the time development of a physical quantity 
A(q , p) is expressed in terms of the Poisson bracket as 


linearity 

(1.21a) 

skew- symmetry 

(1.21b) 

Jacobi identity. 

(1.21c) 

Pj] = Sij. 

0.22) 


dA _ 
d 7 ~ ^ 

k 

= E 


dA d q k dA dp k 

d q k dr dp k dr 

dA dH d A dH\ 

d qkdpk dp k dq k J 


— [A, H], 


(1.23) 


If it happens that [A, H] = 0, the quantity A is conserved, namely dA/dr = 0. 
The Hamilton equations of motion themselves are written as 


d P k r ui 
— — = [pk, H] 
at 


r z_n 
— = [qk, H], 
dr 


(1.24) 


Theorem 1.1. (Noether’s theorem) Let H(qk, p k ) be a Hamiltonian which is 
invariant under an infinitesimal coordinate transformation q k -> q' k — q k + 
efk(q). Then 

Q = Pkfkiq) (1.25) 

k 


is conserved. 


Proof. One has H{q k . pk) — H (qf pi) by definition. It follows from q' k = 
qk + efk(q) that the Jacobian associated with the coordinate change is 


A ij - 


_ Sjj + E 

dqj 


dfi(q) 

dqj 


6 When the commutation relation [A, B] of operators is introduced later, the Poisson bracket will be 
denoted as [A, 5]pb to avoid confusion. 



up to O(s). The momentum transforms under this coordinate change as 


Pi Pi A ji 

j 


— Pi ~ £ 


Epj 


dJi 

dqi ' 


Then, it follows that 


0 = H(q' k , p' k ) - H(q k , p k ) 
8H dH dfi 

= ef(q) - ^- s Pi-r~ 

o q k op j dqj 


— e 


dH 

3 qk 


fk(q ) 


3 H_ dfi_ 

dpj Pl 3 qj 


= s[H , Q] = s 


d Q 
dr 


which shows that Q is conserved. 


□ 


This theorem shows that to find a conserved quantity is equivalent to finding 
a transformation which leaves the Hamiltonian invariant. 

A conserved quantity Q is the ‘generator’ of the transformation under 
discussion. In fact, 


tei, Q] = £ 

k 


3^30 
3 qk dPk 


dqi d Q 
dpk 3 q k 


^2s ik fk(q) = fi(q) 
k 


which shows that Sq ; = sfi(q) — s[q , , Q\. 

A few examples are in order. Let H = p 2 / 2 rn be the Hamiltonian of a free 
particle. Since H does not depend on q, it is invariant under q i-^ q+s- 1, p\-+ p. 
Therefore, Q = p ■ 1 = p is conserved. The conserved quantity Q is identified 
with the linear momentum. 


Example 1.4. Let us consider a paticle m moving in a two-dimensional plane with 
the axial symmetric potential V (r). The Lagrangian is 

L(r, 9) — + r 2 <p 2 ) — V(r). 

The canonical conjugate momenta are: 

p r — mr po = mr 2 0. 


The Hamiltonian is 

2 2 

H = p r r + p e 9 - L = + + y (/■)■ 

2m Zmr- 

This Hamiltonian is clearly independent of 9 and, hence, invariant under the 
transformation 

9 i-^ 9 + e • 1, pe po. 



The corresponding conserved quantity is 


2 * 

Q = Pe • 1 = mr 9 


that is the angular momentum. 

1.2 Canonical quantization 

It was known by the end of the 1 9th century that classical physics, 
namely Newtonian mechanics and classical electromagnetism, contains serious 
inconsistencies. Later at the beginning of the 20th century, these were resolved by 
the discoveries of special and general relativities and quantum mechanics. So far, 
there is no single experiment which contradicts quantum theory. It is surprising, 
however, that there is no proof for quantum theory. What one can say is that 
quantum theory is not in contradiction to Nature. Accordingly, we do not prove 
quantum mechanics here but will be satisfied with outlining some ‘rules’ on which 
quantum theory is based. 

1.2.1 Hilbert space, bras and kets 

Let us consider a complex Hilbert space 7 

n = m,\f (1.26) 

An element of H is called a ket or a ket vector. 

A linear function a : H -* C is defined by 

a(ci\i/[) + C 2 W 2 )) = cia(|Vq» + c 2 a(\^ 2 )) Vc/ e C, |Vh> e U. 

We employ a special notation introduced by Dirac and write the linear function 
as (a | and the action as {(x\i//) e C. The set of linear functions is itself a vector 
space called the dual vector space of H , denoted 11* . An element of 11. is called 

a bra or a bra vector. 

Let {|ei}, 1 ^ 2 ), . . .} be a basis of *H. S Any vector |i fr) e H is then expanded 
as |i/f) = '}2 k ipk\ek), where ifk e C is called the A;th component of | ijr). Now let 
us introduce a basis {(ei|, (£ 2 !, . . .} in TL* . We require that this basis be a dual 
basis of {\ek)}, that is 

(ei\ej) = Sij. (1.27) 

7 In quantum mechanics, a Hilbert space often means the space of square integrable functions L~(M ) 
on a space (manifold) M. In the following, however, we need to deal with such functions as <5 (a;) and 
q^ x with infinite norm. An extended Hilbert space which contains such functions is called the rigged 
Hilbert space. The treatment of Hilbert spaces here is not mathematically rigorous but it will not cause 
any inconvenience. 

8 We assume Ti is separable and there are, at most, a countably infinite number of vectors in the basis. 
Note that we cannot impose an orthonormal condition since we have not defined the norm of a vector. 



Then an arbitrary linear function {a\ is expanded as (a| = ZZk a k( s k\> where 
a/c e C is the kth component of (a|. The action of (a| e 77* on \ij/) e W is now 
expressed in terms of their components as 


{a\f) = = TaitiSu = (1-28) 

ij ij i 

One may consider |i Jr) as a column vector and (a| as a row vector so that (< *| x/r) 
is regarded as just a matrix multiplication of a row vector and a column vector, 
yielding a scalar. 

It is possible to introduce a one-to-one correspondence between elements in 
Tt and 77*. Let us fix a basis {\ek)} of 77 and {(e^|) of 77*. Then corresponding to 
\ifr) = i'klek), there exists an element {\jf \ — ZZk V r t ( e ifc I e ?7*. The reason 
for the complex conjugation of \jf k becomes clear shortly. Then it is possible to 
introduce an inner product between two elements of 77. Let \<p), |i/r) e 7/ . Their 
inner product is defined by 

( 1 = = ^ L29 ) 

k 

We customarily use the same letter to denote corresponding bras and kets. The 
norm of a vector |i/r) is naturally defined by the inner product. Let |||i/''}|| = 
y/(4f\ 4r) . It is easy to show that this definition satisfies all the axioms of the norm. 
Note that the norm is real and non-negative thanks to the complex conjugation in 
the components of the bra vector. 

By using the inner product between two ket vectors, it becomes possible 
to construct an orthonormal basis {| ek)} such that (|e;), | ej)) — ( et\ej ) = Sij. 
Suppose |i j/) = ^k\ek) ■ By multiplying {ek \ from the left, one obtains 

{ekW) = fk- Then \f) is expressed as \ f) — J2k( e kW\ek) = J2k \ e k){ek\f)- 
Since this is true for any |i jr), we have obtained the completeness relation 

Y J \ e k){ek\ = I , (1-30) 

k 

I being the identity operator in T-L (the unit matrix when It is finite dimensional). 

1.2.2 Axioms of canonical quantization 

Given an isolated classical dynamical system such as a harmonic oscillator, we 
can construct a corresponding quantum system following a set of axioms. 

Al. There exists a Hilbert space H for a quantum system and the state of the 
system is required to be described by a vector |i [r) e ’H. In this sense, 
|i fr) is also called the state or a state vector. Moreover, two states i// } and 
c\ir) (c e C, c ^0) describe the same state. The state can also be described 

as a ray representation of T-L. 



A2. A physical quantity A in classical mechanics is replaced by a Hermitian 
operator A acting on 77. y The operator A is often called an observable. 
The result obtained when A is measured is one of the eigenvalues of A. (The 
Hermiticity of A has been assumed to guarantee real eigenvalues.) 

A3. The Poisson bracket in classical mechanics is replaced by the commutator 

[A,B] = AB-BA (1.31) 

multiplied by —i/h. The unit in which h = 1 will be employed hereafter 
unless otherwise stated explicitly. The fundamental commutation relations 
are (cf (1.22)) 


[qi,qj] = [ Pi , Pj ] = 0 [qi, pj] = i Sij. (1.32) 


Under this replacement, Hamilton’s equations of motion become 


d qi 

d t 


-[qi, H] 

1 


dpi 

dr 


1 


~[Pi, H]. 
1 


(1.33) 


When a classical quantity A is independent of t explicitly, A satisifies the 
same equation as Hamilton’s equation. By analogy, for A which does not 
depend on t explicitly, one has Heisenberg’s equation of motion: 


dA 

dr 


t[A, H]. 


(1.34) 


A4. Let 1 1 j/) e It he an arbitrary state. Suppose one prepares many systems, each 
of which is in this state. Then, observation of A in these systems at time t 
yields random results in general. Then the expectation value of the results is 
given by 


= WA(r) hA) 

<M) 


(1.35) 


A5. For any physical state \ \[f) e 77, there exists an operator for which | xjr) is one 
of the eigenstates. 10 


These five axioms are adopted as the rules of the game. A few comments 
are in order. Let us examine axiom A4 more carefully. Let us assume that |t/r) is 
normalized as |||i/r}|| 2 = (iA'I'A) = 1 for simplicity. Suppose A(r) has the set of 
discrete eigenvalues {a,,} with the corresponding normalized eigenvectors { | « ) } : 11 


A(t)\n) — a n \n) {n\n) — 1. 

9 An operator on T-L is denoted by 7 This symbol will be dropped later unless this may cause 
confusion. 

10 This axiom is often ignored in the literature. The raison d'etre of this axiom will be clarified later. 

11 Since A(t ) is Hermitian, it is always possible to choose (|/i)} to be orthonormal. 



Then the expectation value of A{t) with respect to an arbitrary state 
W = '^2'J'n\n) fn = {n \f) 


{f\A(t)\f) = ^2f*f n {m\A(t)\n) = 'Yha n \i/ n \ 2 . 

m,n n 

From the fact that the result of the measurement of A in state \ri) is always a„, it 
follows that the probability of the outcome of the measurement being a„, that is 
the probability of | \jr) being in \ri), is 

\fn\ 2 = mf)\ 2 . 

The number {n\\jf) represents the ‘weight’ of the state \ri) in the state | \[r) and is 
called the probability amplitude. 

If A has a continuous spectrum a, the state i//} is expanded as 


w = / da \j/(a)\a). 
The completeness relation now takes the form 

da = I . 


S' 


(1.36) 


Then, from the identity f do' \a'){a'\a) — |fl), one must have the normalization 

{a'\a) = S(a' -a), (1.37) 

where 8(a) is the Dirac 8 -function. The expansion coefficient is obtained 
from this normalization condition as i/Ka) = If W) is normalized as 

(\//\\ fr) — 1, one should have 


-I 


1 = / da d a' 


'){a\a') = J 


da |i 


It also follows from the relation 


(i/f|A|i/f) = J a\x[r(a)\ 2 da 

that the probability with which the measured value of A is found in the interval 
[a, a + do] is \x[r(a)\ 2 da. Therefore, the probability density is given by 


p(a) = \(a\is)\‘- 


(1.38) 


Finally let us clarify why axiom A5 is required. Suppose that the system 
is in the state | \[r) and assume that the probability of the state to be in \<p) 
simultaneously is \{i'\(p)\ 2 ■ This has already been mentioned, when |i/r) is an 
eigenstate of some observable. Axiom A5 asserts that this is true for an arbitrary 
state |i jr). 



1.2.3 Heisenberg equation, Heisenberg picture and Schrodinger picture 

The formal solution to the Heisenberg equation of motion 


di 

dr 


-U,H] 


is easily obtained as 


A(t) = q' h, A{ 0)e _iffr . 


Therefore, the operators A(r ) and A (0) are related by the unitary operator 


U(t) = e 


-\Ht 


(1.39) 

(1.40) 


and, hence, are unitary equivalent. This formalism, in which operators depend on 
f, while states do not, is called the Heisenberg picture. 

It is possible to introduce another picture which is equivalent to the 
Heisenberg picture. Let us write down the expectation value of A with respect 
to the state |t/f) as 

<A(0) = We iAt Am~ i6t w 

= ((tA|e i ^)A(0)(e- i ^|tA». 

If we write \ijr(t)) = e~' H, \\[r), we find that the expectation value at t is also 
expressed as 

<A(f)> = <iKO|A(0)IiKO>. (1.41) 

Thus, states depend on t while operators do not in this formalism. This formalism 
is called the Schrodinger picture. 

Our next task is to find the equation of motion for 1 1 jr (t)). To avoid confusion, 
quantities associated with the Schrodinger picture (the Heisenberg picture) are 
denoted with the subscript S (H), respectively. Thus, | (^))s = ^~ lHr \i')n 
and As = Ah( 0). By differentiating |i/r(f)}s with respect to t, one finds the 

Schrodinger equation: 

ij-|V f (0)s = H\x/f(t)) s . (1.42) 

at 

Note that the Hamiltonian H is the same for both the Schrodinger picture and the 
Heisenberg picture. We will drop the subscripts S and H whenever this does not 
cause confusion. 


1.2.4 Wavefunction 

Let us consider a particle moving on the real line M and let x be the position 
operator with the eigenvalue y and the corresponding eigenvector y); i |y) = 
y|y). The eigenvectors are normalized as { x\y ) — 8 (x — y). 



Similarly, let q be the eigenvalue of p with the eigenvector |g); p\q) = q\q) 
such that ( p\q ) — S(p — q). 

Let |i jr) e LL be a state. The inner product 

fix) = (x\f) (1.43) 

is the component of \x//) in the basis |x}, 

\f) = J |x)(.t| dx | f) — J f(x)\x) dx. 

The coefficient f(x) e C is called the wavefunction. According to the 
earlier axioms of quantum mechanics outlined, it is the probability amplitude of 
finding the particle at x in the state \x//), namely |i/ r (x)| 2 dx is the probability of 
finding the particle in the interval [x, x + dx]. Then it is natural to impose the 
normalization condition 


/dx|^)| 2 = ( W) = l (1.44) 

since the probability of finding the particle anywhere on the real line is always 
unity. 

Similarly, i fr(p) — (p\4r) is the probability amplitude of finding the particle 
in the state with the momentum p and the probability of finding the momentum 
of the particle in the interval [p, p + d/j] is \ir(p)\ 2 dp. 

The inner product of two states in terms of the wavefunctions is 

(i/f|0) = J dx (%lr\x){x\(p) — J d x \ls*(x)(j>(x), (1.45a) 

= J dp Wp)(p\<p) = J dpfjf*(p)<j>(p). (1.45b) 

An abstract ket vector is now expressed in terms of a more concrete 
wavefunction ir(x) or i jr{p). What about the operators? Now we write down the 
operators in the basis \x). From the defining equation x\x) = x\x), one obtains 
{x\x — (x\x, which yields after multiplication by |i //) from the right, 

{x\x\^jf) — x{x\\jr) — x\ [r(x). (1.46) 

This is often written as (xxjr){x) = xx[r(x). 

What about the momentum operator pi Let us consider the unitary operator 

U(a) = 


Lemma 1.1. The operator U (a) defined as before satisfies 


U (a) \x) = \x + a). 


(1.47) 



Proof. It follows from [jc, p] — i that [x,/3' ! ] = 'mp n 1 for n 
Accordingly, we have 


[x, U (a)] = 



n 


= aU (a) 


1 , 2 ,.... 


which can also be written as 

xU (a ) | jc ) = U (a)(x + a)\x) — (x + a)U (a)|x). 

This shows that U (a) \x) oc \x + a). Since U (a) is unitary, it preseves the norm 
of a vector. Thus, U (a) \x) = \x + a). □ 

Let us take an infinitesimal number s. Then 

U(s)\x) = \x + s) ~ (1 - iep)\x). 

It follows from this that 


p\x) 


\x + s) - |x) e- 


— IS 


■o d 

■> 1-H*> 

ax 


(1.48) 


and its dual 


, , , {x + s| - <x| s^o . d 

(x\p = : ► — i— (-^1 - 

is dx 


Therefore, for any state |i jr), one obtains 


d d 

{x\p\f) = - 1 — (x| f) = -l—f{x). 
ax ax 


(1.49) 


(1.50) 


This is also written as (p f)(x) — — i d\jf(x)/dx. 

Similarly, if one uses a basis | p), one will have the momentum representation 
of the operators as 


_*> 

II 

i 

5^ 

(1.51) 

5> 

II 

(1.52) 

d 

(P \x\f) = 1 —f(p) 

dp 

(1.53) 

(p\p\t) = pif(p)- 

(1.54) 


Exercise 1.2. Prove (1.5 1) — (1 .54). 
Proposition 1.1. 


{x\p) = 


e i px 

V2n 


(P\x) 


e -i P* 

V2n 


(1.55) 


(1.56) 



Proof. Take |i//) = | p) in the relation 


d 

(pf)(x) = {x\p\ir) = -\—f(x) 
ax 

to find 

d 

p{x\p) = (x\p\p) = ~i—(x\p). 

ax 

The solution is easily found to be 

(X\p) = Ce ipx . 

The normalization condition requires that 

$(x - y) = My) = {x\ J \p){p\ dp |y} 

= C 2 J d pjpb-y* 

— C 2 2jt8(x — y), 

where C has been taken to be real. This shows that C = 1 1 \Fhx. The proof of 
(1.56) is left as an exercise. □ 


Thus, if(x) and fip) are related as 

/ dx ■ „ 

p ty(x) (1.57) 

which is nothing other than the Fourier transform of t//(x). 

Let us next derive the Schrodinger equation which i// (x ) satisfies. By 
applying (x| on ( 1 .42) from the left, we obtain 

(jcii^i^w) = (x\Hm)) 

dr 


tip) = (P \f) = / dx {p\x){x\f) 


/' 


where the subscript S has been dropped. For a Hamiltonian of the type H = 

p 2 /2m + V (x), we obtain the time-dependent Schrodinger equation: 


i— f(x, t) — 
dr 


P~ 

2m 

1 d 2 

2m dx 2 


V(x) 


if(t ) , 


i//(x, t) + V (x)\js(x, r), 


(1.58) 


where x/r (x, t) = {x\xj/(t)). 

Suppose a solution of this equation is written in the form fix, t) — 
T(t)(p(x). By substituting this into (1.58) and dividing the result by fix, r), 
we obtain 

i T'(t) — (j)"(x)/2m + V (x)<t>(x) 


T(t ) 


<P(x) 



where the prime denotes the derivative with respect to a relevant variable. Since 
the LHS is a function of t only while the right-hand side (RHS) of x only, they 
must be a constant, which we label E. Accordingly, there are two equations, 
which should be solved simultaneously, 


i T'(t) = ETit ) 

(1.59) 

1 d 2 


- — + V(x)<P(x) = E<p(x). 

Zm ax~ 

(1.60) 

The first equation is easily solved to yield 


T(t) — exp(— \Et) 

(1.61) 


while the second one is the eigenvalue problem of the Hamiltonian operator 
and called the time-independent Schrodinger equation, the stationary state 
Schrodinger equation or, simply, the Schrodinger equation. For three- 
dimensional space, it is written as 

1 9 

V‘0(jc) + V(x)<p(x) = E(p{x). (1.62) 

2m 

1.2.5 Harmonic oscillator 

It is instructive to stop here for the moment and work out some non-trivial 
example. We take a one-dimensional harmonic oscillator as an example since 
it is not trivial, it is still solvable exactly and it is very important in the folllowing 
applications. 

The Hamiltonian operator is 

H = ^ — b - mar.x 2 [. x , /3] = i. (1.63) 

2m 2 

The (time-independent) Schrodinger equation is 
Id 2 1 9 9 

— ^ir(x) + -mw /L x z iJ/(x) = Ei/f(x). (1.64) 

2m dx z 2 

By rescaling the variables as £ = ~Jma>x, e — E/hco, one arrives at 

+ {e - = 0. (1.65) 

The normalizable solution of this ordinary differential equation (ODE) exists only 
when £ = £„ = (« + ^) (n = 0, 1,2,...) namely 

E = E n = (n + j)a> (n — 0, 1, 2, . . .) (1.66) 


and the normalized solution is written in terms of the Hermite polynomial 



as 


( 1 . 68 ) 


iK$) = 


2 n n\y/jt 


H„Une 


-r/2 


This eigenvalue problem can also be analysed by an algebraic method. 
Define the annihilation operator a and the creation operator a ' by 


„ Imw A / 1 

0 = V 2 X+ ‘t2n,o/ 

(1.69) 

Imco A / 1 

a' = J x — lJ p. 

V 2 V 2 mw 

(1.70) 

The number operator N is defined by 


N — o' a. 

(1.71) 

Exercise 1.3. Show that 


[a, a] = [a^, a = 0 [a, ii 1 ^] = 1 

(1.72) 

and 


[)V,fl] = — a [)V,fl t ]=a^. 

(1.73) 

Show also that 


H = (N + \)co. 

(1.74) 


Let | n) be a normalized eigenvector of N, 

N\n) — n\n). 

Then it follows from the commutation relations proved in exercise 1.3 that 

N(a\n)) = (aN — a)\n) = (n — 1)(a|«}) 

N(a ' |n}) = (a 1 N + a ^)\n) — (n + 1 )(a^|n)). 

Therefore, a decreases the eigenvalue by one while a 1 increases it by one, hence 
the name annihilation and creation. Note that the eigenvalue n > 0 since 

n — (n|!V|n) = ((n\a^)(a\n)) — ||a|n)|| 2 > 0. 

The equality holds if and only if a\n) — 0. Take a fixed no > 0 and apply a 
many times on \no). Eventually the eigenvalue of a k \no) will be negative for 
some integer k > no, which is a contradiction. This can be avoided only when no 
is a non-negative integer. Thus, there exists a state |0) which satisfies a|0) = 0. 
The state |0) is called the ground state. Since N |0) = a r a\0) = 0, this state is 



the eigenvector of N with the eigenvalue 0. The wavefunction i/btCO = (jc | 0) is 
obtained by solving the first-order ODE 

(.r|fl|0> = (dv '/ f °( X ' ) + fncoxfoix)^ — 0. (1-75) 

The solution is easily found to be 

i/tq(x) — C exp( — mcox 2 /2) (1-76) 

where C is the normalization constant given in (1.68). An arbitrary vector | n) is 
obtained from 1 0) by a repeated application of a'. 

Exercise 1.4. Show that 

I n) = -^(aVlO) (1.77) 

V n\ 

satisfies N\n) = n\ri) and is normalized. 

Thus, the spectrum of N turns out to be Spec N = {0, 1, 2, . . .} and hence 
the spectrum of the Hamiltonian is 

Spec £ = {£,§, §,...}. (1.78) 

1.3 Path integral quantization of a Bose particle 

The canonical quantization of a classical system has been discussed in the 
previous section. There the main role was played by the Hamiltonian and the 
Lagrangian did not show up at all. In the present section, it will be shown that 
there exists a quantization process, called the path integral quantization, based 
heavily on the Lagrangian. 

1.3.1 Path integral quantization 

We start our analysis with one-dimensional systems. Let x(t) be the position 
operator in the Heisenberg picture. Suppose the particle is found at Xj at time 
tj (>0). Then the probability amplitude of finding this particle at xf at later time 

tf ( >tt ) is 

(x f , t f \ Xi , tj) (1.79) 

where the vectors are defined in the Heisenberg picture, 12 

x(ti)\xi, tj) = xi\xj, ti) (1.80) 

x(tf)\xf, tf) = Xf\xf, tf). (1.81) 

*“We have dropped S and H again to simplify the notation. Note that |x/ . f;> is an instantaneous 
eigenvector and hence parametrized by the time t( when the position is measured. This should not be 
confused with the dynamical time dependence of a wavefunction in the Schrodinger picture. 



The probability amplitude (1.79) is also called the transition amplitude. 

Let us rewrite the probability amplitude in terms of the Schrodinger picture. 
Let x = x(0) be the position operator with the eigenvector 

x\x ) — x\x). (1.82) 

Since x has no time dependence, its eigenvector should be also time independent. 


If 


x(t,) = e iHt ‘xe~ iHti 
is substituted into (1.80), we obtain 

e'^'xe -1 ^ \xj, tj ) = Xi\xj, U). 

By multiplying e~ lH,i from the left, we find 

x[e~ lfl,i | Xi, t ,)] = Xi[eT lflti | Xi, U)]. 


(1.83) 


This shows that the two eigenvectors are related as 



1 Xj,ti) = e lHti \xi). 

(1.84) 

Similarly, we have 




1 x f ,tf) = e lH ‘f\x f ), 

(1.85) 

from which we obtain 




{Xf, t/\ = ( x f \e ~' H,f . 

(1.86) 

From these results, we 

express the probability amplitude in 

the Schrodinger 

picture as 



{x/,tf\xi,ti) = {xf\e ' H(,f u) \xi). 

(1.87) 

In general, the function 

h(x,y; 0) = <x|e ^ p \y) 

(1.88) 


is called the heat kernel of H . This nomenclature originates from the similarity 
between the Schrodinger equation and the heat equation. The amplitude (1.87) is 
the heat kernel of H with imaginary ft : 

{xf, tf\xi, ti ) = h(xf,Xi\ i (tf - tj)). (1.89) 

Now the amplitude (1.87) is expressed in the path integral formalism. To 
this end, we consider the case in which tf — t, — s is an infinitesimal positive 
number. Let us put x,- = x and Xf = y to simplify the notation and suppose the 
Hamiltonian is of the form 




Figure 1.1. The integration contour. 


We first prove the following lemma. 

Lemma 1.2. Let a be a positive constant. Then 



(1.91) 


Proof. The integral is different from an ordinary Gaussian integral in that the 
coefficient of p 2 is a pure imaginary number. First replace p by z = x + iy. The 
integrand exp(— iaz 2 ) is analytic in the whole z -plane. Now change the integration 
contour from the real axis to the one shown in figure 1.1. Along path 1 , we have 
dz = dx and hence this path gives the same contribution as the original integration 
(1.91). The contribution from paths 2 and 4 vanishes as R — > oo. Noting that the 
variable along path 3 is z = (1 — i)x, we evaluate the contribution from this path 
as 


(1 — i) 




The summation of all the contribution must vanish due to Cauchy’s theorem and, 
hence, 



Now this lemma is employed to obtain the heat kernel for an infinitesimal 
time interval. 


Proposition 1.2. Let H be a Hamiltonian of the form (1.90) and e be an 
infinitesimal positive number. Then for any x , y £ TR, we find that 


(x|e-" £ | y) = 


1 


\f2ni 


: exp 


IS 


te 


(x - y) 


2 \ 2 


V 


+ 0(e 2 ) + 0(e(x 


y) 2 ) 


(1.92) 


Proof. The completeness relation for the momentum eigenvectors is inserted into 
the LHS of (1.92) to yield 


where 


(.r|e-^|y> = J dk{x\c~ iski \k)(k\y) 
= f ^ *y ieH ik. 

J 2 : r 


1 d 2 

H x = j 2 + y W- 
2m ax z 


Now we find from the commutation relation of 3 V = d/dx and e lkx that 


d x e ikx = i ke ikx + e ikx d x = e ikx (i k + 3 V ). 


Repeated application of this commutation relation yields 

d n e itx _ e i kx {ik + (f| = (). 1.2....) 

from which we obtain 


e — ie[— d x /2m+V(x)\^kx _ ^.kx & —\e[—(\k+d x )~ /2m+ V(x)] 


Therefore, 


(x|e _i ^ £ |y) = f ^ gi^(x— y)g— i£[— (ik+dx) 2 /2m+V (.r)l 

J 2 it 

_ f ^ -i\ek 2 /2m-k(x— v)1 — ie[ — ikd r /m— dl/lm+VixM j 

J 2tc 

where the ‘ 1 ’ at the end of the last line is written explicitly to remind us of the 
fact d x 1 = 0. If we further put p = *Je/2mk and expands the last exponential 
function in the last line, we obtain 


(x|e- i£ "|y> = y^ e im( *- y)2/2£ 


[ — e -i ' 

J 2ir 


E 

n = 0 


(-i e) n 


2 

Ul p d x 

em 


[p+*/m/2£(x-y)] 

3 2 


2m 


+ V(x) 


• 1. 



If we put q — p + «Jm /2s (x — y ) and use lemma 1.2, we obtain: 


(x|e- i£ "|y> = J ^i £ 


-n 


(— e 2 ) (— i) 

1 + (-ie)y(x) + . \ x - y)d x V(x) 


+ 0(e 2 ) + 0(s\x - y\ 2 ) 


= ,ie(m/2)[(x—y)/e] 2 


27rie 


x exp 


— isV 


x + y 


+ (D(s 2 ) + 0(e\x - y\ 2 ) 


Thus, the proposition has been proved. 


□ 


Note that the average value (x + y)/2 appeared as the variable of V in (1 .92). 
This prescription is often called the Weyl ordering. 

It is found from ( 1 .92) that the integrand oscillates very rapidly for \x—y\ > 
y/s and it can be regarded as zero in the sense of distribution (the Riemann- 
Lebesgue theorem). Therefore, as x — y < e, the exponent of (1.92) approaches 
the action for an infinitesimal time interval [0, e], 

AS = £ dr ir - V (x)] ~ [|u 2 - V(x)] £ (1.93) 

where v — (x — y)/s is the average velocity and x is the average position. 
Equation (1.92) also satisfies the boundary condition for £ -> 0, 


(xle-^ly)^ 0 (x| y) = S(x-y). 


(1.94) 


This can be shown by noting that 


m 


/ oo 

-oo dx y 


,im(x—y) 2 /2s j 


The transition amplitude (1.79) for a finite time interval is obtained by 
infinitely repeating the transition amplitude for an infinitesimal time interval one 
after another. Let us first divide the interval tf — tj into n equal intervals, 

„ _ f .f ~ 


Put to — tj and tk — to + ek (0 < k < n). Clearly t n — tf. Insert the completeness 
relation 

1 = [ dxk\xk, tk){xk, tk\ (l<k<n-l) 



for each instant of time tk into (1.79) to yield 


(xf,tf\xi,ti) = (x/,tf \ J dx n -i\x n -i,t„-i)(x n -i,t„-i\ 

x J dx n - 2 \x n - 2 , t n - 2 ) ■ . ■ j dx\\x\, t\)(x\, t\\xo, to). 

Let us consider here the limit s —> 0, namely n — > oo. Proposition 1.2 states that 
for an infinitesimal s, we have 


{xk . tk \Xk - 1 • tk— i ) 



where 

A Sk = s 

Therefore, we find 


m I Xk - Xk-i 
2 


- V 


Xk-l +Xk 


(Xf, tf\xj, ti) — lim 


/0 „ n — 1 / n 

(^) / n^«p(>E- 

7=1 k = 1 


. (1.95) 


If n — 1 points xi,x 2 , , x„-i are fixed, we obtain a piecewise linear path from 
xo to x„ via these points. Then we define 5({x^}) = ASk, which in the limit 
n — > oo can be written as 

S({x k }) “ S[x(t)] = J f dr [|n 2 - V(x)] . (1.96) 


Note, however, that the S[x(r)] defined here is formal; the variables x k and x^-i 
need not be close to each other and hence v = (x* — Xk-i)/e may diverge. This 
transition amplitude is written symbolically as 


<x/, f/|X/, ti) 


J Vx exp 
J Vx exp 


fit nn 9 

L d ' ( 2 " ■ 

- V (x)) 

ftf 

/ drL(x,x) 

'ti 



(1.97) 


which is called the path integral representation of the transition amplitude. It 
should be stressed again that the V is not well defined and that this expression is 
just a symbolic representation of the limit (1.95). 

The integration measure is understood as 


J Vx = summation over all paths x(t) withx(f,) = xj,x{tf) — x / (1.98) 



(*i> h) 


X 


Figure 1.2. All the paths with fixed endpoints are considered in the path integral. The 
integrand exp[i5({.x^})] is integrated over these paths. 


see figure 1.2. Although Dx or .S'({x/t}) is ill defined in the limit n — >■ oo, the 
amplitude (xj, tf\xi, tj) constructed from Vx and S {{xk }) together is well defined 
and hence meaningful. This point is clarified in the following example. 


Example 1.5. Let us work out the transition amplitude of a free particle moving 
on the real axis with the Lagrangian 


L — ^mx 2 . 


(1.99) 

The canonical conjugate momentum is p = dL/dx = mx and the Hamiltonian is 


H = px — L — — . 

2m 


( 1 . 100 ) 

The transition amplitude is calculated within the canonical quantum theory as 


(x f ,tf\xi,ti) = <x/|e lHT \ Xi ) = J dp(x f \e lH1 \p){p\xi) 


= f — e 1 

J 2 : r 


p(xf-Xi) -iT(p~/2m) 


2tcYT 


exp 


i m(xf — x,) 2 

2 ? 


(1.101) 


where T = tf — tj. 

This result is obtained using the path integral formalism next. The amplitude 
is expressed as 

/ m \ n / 2 [ 

{x f , t f \xi , tj) = hm — — / dx, . . . dx„_i 

n->o o \ 27tl£/ J 


exp 


' 

L k=\ 


m / Xk — Xk - 1 


(1.102) 


where e = T /n. After scaling the coordinates as 


171 \t/2 


yk 


(£) 


Xk 


the amplitude becomes 


/ m \«/2 / 2e V" 1),/2 
(x/, tf\xi, U ) = lim — — — 

«->o o\27ne/ \m / 


/' 


d>’i . . . d v/i-i exp 
It can be shown by induction (exercise) that 


r w 

i 

L k= 1 


_ w- 1) 2 


(1.103) 


/ 


dvi • ■ • dv„_i exp 


i J^(yk - yk- 1) 2 


1 

L k=\ 


- 

" (ur) (n-1) " 

- 

n 


1/2 


P i(rn-ro) 2 /« 


Taking the limit n -> 00 , we finally obtain 


/ m \n/t i7t\e\ 

{x f ,tf\xi,ti)= Inn — — 

n^oo \2jri£/ V /W / 


m \”/2 f2jtie\ {n 1 )// “ 1 ; 

~Jn 


im (x f —XiY/{2ne) 


2ttiT 


tttt exp 


i m(xf — x,') 2 
2T 


(1.104) 


It should be noted here that the exponent is the classical action. In fact, if we 
note that the average velocity is v = (x/ — x,-)/(f/ — ty), the classical action is 
found to be 


Set = 



dt—mv 

2 


m (xf - xj ) 2 
2(f/ — I;) 


It happens in many exactly solvable systems that the transition amplitude takes 
the form 


(xf, tf\xt, ti) = Ae lScl , 


(1.105) 


where all the effects of quantum fluctuation are taken into account in the prefactor 
A. 


1.3.2 Imaginary time and partition function 

Suppose the spectrum of a Hamiltonian H is bounded from below. Then it is 
always possible, by adding a postive constant to the Hamiltonian, to make H 
positive definite; 


Spec H — {0 < Eq < Ei < E 2 <■■■}■ 


(1.106) 



It has been assumed for simplicity that the ground state is not degenerate. The 
spectral decomposition of e~' Hl given by 

e-' Hr = J2 e ~' E "‘\ n )( n \ (1.107) 

n 

is analytic in the lower half-plane of t, where H\n) = E n \ n). Introduce the Wick 
rotation by the replacement 

t = —it (r e M+) (1.108) 

where IR+ is the set of positive real numbers. The variable r is regarded as 
imaginary time, which is also known as the Euclidean time since the world 
distance changes from t 2 — x 2 to — (r 2 + x 2 ). Physical quantities change under 
this change of variable as 


d.v . d.r 



Accordingly, the path integral is expressed in terms of the new variable as 

{x f , r f \xi, r i) = {x f \e~ lHz f~ Ti) \xi) 

= /me' /T MH£) 2+ H, (1.109) 

where V is the integration measure in the imaginary time r. 

For a given Hamiltonian H, the partition function is defined as 

Z(/3) = Tr e~P H (/3 > 0), (1.110) 

where the trace is over the Hilbert space associated with H . 

Let us take the eigenstates {IE,!)} of H as the basis vectors of the Hilbert 
space; 

H\E n ) = E n \E n ), ( E m \E n } = S mn . 

Then the partition function is expressed as 

Z(/l) = ^E n \eT p6 \E n ) = J2( E n\e~ PEn \En) 

n 

= 


d.iii) 



The partition function is also expressed in terms of the eigenvector |x) of x. 
Namely 

Z(ft) = Jdx{x \e~ pH \x). (1.112) 

If ft is identified with the Euclidean time by putting ft — iT , we find that 


{x f |e lHT \xj 


{x f |e pH \xi), 


from which we obtain the path integral expression of the partition function 


Z(f )= fdyf 

-/. ' 
J pen 


x(0)=x(P)=y 
Vx exp 


Vx exp 


f 


, 1 , 

dr ( -mx z + V(x) 


/ periodic 


f 


1 


dr I - mx ~ + V (x) 


where the integral in the last line is over all paths periodic in [0, ft]. 


(1.113) 


1.3.3 Time-ordered product and generating functional 

Define the T -product of Heisenberg operators A(t) and Bit) by 

T[A(ti)B(t 2 )] = A{tx)B{t2)d{h - t 2 ) + B(t 2 )A{t\)0(t 2 - ft) (1.114) 

9(t) being the Heaviside function. 13 Generalization to the case with more than 
three operators should be trivial; operators in the bracket are rearranged so that the 
time parameters decrease from the left to the right. The T -product of n operators 
is expanded into n ! terms, each of which is proportional to the product of n — 1 
Heaviside functions. An important quantity in quantum mechanics is the matrix 
element of the T -product, 

{Xf, tf\T[x{t\)x{tf) ■ • • x(t n )]\Xi , ti), o f; <t\,t2 Jr, < tf). (1.115) 

Suppose tj < t\ < t 2 < ■ ■ ■ < t n < tf in equation (1.115). By inserting the 
completeness relation 

/ OO 

dxk\xk, tk)(xk , t k | (k = 1 , 2 ,..., n) 

-OO 

into equation ( 1 . 1 1 5), we obtain 


{Xf, tf\x(t n ) ■ --xih)] Xi, ti) 

= (Xf , tf I x(tn ) J dx n \x n , t„ ) {. X„ Jn\--- x{h ) / dxi\x\, h){x\, t\\xi, tj) 

= J dxi . . ,dx„ X\ . . .x„(xf, t/\x n , t n ) ■ ■ ■ (*i, h\xi, ft) ( 1 . 116 ) 


13 The Heaviside function is defined by 


0(x) = 


0 x < 0 

1 x > 0. 



where use has been made of the eigenvalue equation x(tk)\xk, tk) = Xk\ Xk, tk). If 
(xk, tk\xk-i, tk- 1 ) in the last line is expressed in terms of a path integral, we find 

{xf, tf\x(t n ) ■ ■ -x(ti)\xi, ti) = J Vxx(ti) . . ,x(f„)e lS . (1.117) 

It is crucial to note that x(tk) in the LHS is a Heisenberg operator, while 
x(tk)(—Xk) in the RHS is the real value of a classical path x(t) at time tk- 
Accordingly, the RHS remains true for any ordering of the time parameters in 
the LHS as long as the Heisenberg operators are arranged in a way defined by the 
T -product. Thus, the path integral expression automatically takes the T -product 
ordering into account to yield 

(xf,tf\T[x(t n )...x(ti)]\xi,ti)= ( Vxx(ti) . . ,x(t n )e lS . (1.118) 


The reader is encouraged to verify this result explicitly for n = 2. 

It turns out to be convenient to define the generating functional Z\J ] to 
obtain the matrix elements of the T -products efficiently. We couple an external 
field J(t) (also called the source) with the coordinate x(t) as x(t)J(t ) in the 
Lagrangian, where J (t) is defined on the interval [ti, tf\. Define the action with 
the source as 


rtf 

S[x{t),J(t)] = J d t[\mx 2 -V{x) + xJ], (1.119) 

The transition amplitude in the presence of J(t) is then given by 

( Xf , tf\Xi, ti)j = I Vx exp f dt (^mx 2 — V(x) + x7)j. (1.120) 


The functional derivative of this equation with respect to J(t ) (r, < t < tf) yields 


S 

SJ(t) 


(Xf, tf\Xi , ti)j 


J Vxix(t)exp 



dt (^ mx 2 — V (x) + x J) 


( 1 . 121 ) 

Higher functional derivatives are easy to obtain; the factor i x(tk) appears in the 
integrand of the path integral each time S/SJ(t) acts on (x f , tf\x{, t,)j. This is 
nothing but the matrix element of the T -product of the Heisenberg operator x(t) 
in the presence of the source J(t). Accordingly, if we put .1(1) — 0 in the end of 
the calculation, we obtain 


(xf, t f \T [x(t n ) . . .x{t\)] | Xi, tj) 
S n 


= (-i)" 




/ 


Vx e : 


iS[A(f), ./(?)] 


( 1 . 122 ) 


7=0 


It often happens in physical applications that the transition probability 
amplitude between general states, in particular the ground states, is required 



rather than those between coordinate eigenstates. Suppose the system under 
consideration is in the ground state |0) at r,- and calculate the probability amplitude 
with which the system is also in the ground state at later time tf. Suppose 
J (t) is non-vanishing only on an interval [a, b] C [f, , tf]. (The reason for this 
assumption will become clear later.) The transition amplitude in the presence of 
J (t) may be obtained from the Hamiltonian H J — H — x(t)J(t ) and the unitary 
operator U J (tf,ti) of the Hamiltonian. The transition probability amplitude 
between the coordinate eigenstates is 

( Xf,tf\xi,ti)j = {xf\U J (tf, tj)\xi) 

= {x f \e.- lH{, f- b) U J (b,a)e- lHia - ,i) \x i ), (1.123) 

where use has been made of the fact H J = H outside the interval [ a, b ]. By 
inserting the completeness relations of the energy eigenvectors |n)(n| = 1 
into this equation, we obtain 

(xf, t f \xi , tj)j — y^ i (xf\e~ lH{t f~ b) \m)(m\U J (b, a)|n)(n|e _l//(fl_r,) |x;} 

m,n 

— e~ lE,n bf-b) Q -\E n (a-ti) j | X j)(m \U J (b, a)\n). 

m,n 

(1.124) 


Now let us Wick rotate the time variable t -> — ir under which the exponential 
function changes as s~' Et -> e~ Er . Then the limit r f -> oo , r,- — > — oo 
picks up only the ground states m = n — 0. Alternatively, we may introduce a 
small imaginary term — isx 2 in the Hamiltonian so that the eigenvalue has a small 
negative imaginary part. Then only the ground state survives in the summations 
over m and n under r f —¥■ oo, r,- — > — oo. 

After all we have proved that 


lim (xf,tf\xi,ti)j = (xf\0)(0\xi)Z[J] (1.125) 

tf—>0 O 
ti ~> — OO 

where we have dehned the generating functional 

Z[J] = (0\U J (b,a)\0) = lim (0\U J (t f ,ti)\0). (1.126) 

tf— >oo 
ti~>— 00 


The generating functional may be also expressed as 


Z[J] = 


lim 

t f — >oo 
ti — > — OO 


(■ Xf , tf\Xj, tj)j 

(jc/IOXOI*,-) 


(1.127) 


Note that the denominator is just a constant independent of Z[J], Now we have 
found the path integral representation for Z[J ], 

Z[J]=M f Vx e i5[v ’ /] 


(1.128) 



where the path integral is over paths with arbitrarily fixed x, and x-f. The 
normalization constant Af is chosen so that Z[0] = 1, namely 

AT 1 = j Vx e iS[ *’ 0] . 

It is readily shown that Z[J] generates the matrix elements of the '/'-product 
between the ground states: 


<0|T [x(t\) ■ ■ -x(t n )] |0> = (-i)" 


8 n 


SJ(ti) ■ ■ -SJ(t n ) 


Z[J] 


(1.129) 


7=0 


1.4 Harmonic oscillator 

We work out the path integral quantization of a harmonic oscillator, which is an 
example of systems for which the path integral may be evaluated exactly. We also 
introduce the zeta function regularization, which is a useful tool in many areas of 
theoretical physics. 


1.4.1 Transition amplitude 


The Lagrangian of a one-dimensional harmonic oscillator is 

L — jinx 2 — j murx 2 . (1.130) 

The transition amplitude is given by 

(x f , t f \ X j , ti) = j Vx e iS[ * W] , (1.131) 

where 5[.r (/)] = L d Z is the action. 

Let us expand S[.r] around its extremum x c (l) satisfying 


SS[x] 

Sx 


= 0 . 

X=X c (t) 


(1.132) 


Clearly x c (t) is the classical path connecting (x;, /,) and (x/, tf) and satifies the 
Euler-Lagrange equation 

x c + co 2 x c — 0. (1.133) 

The solution of equation (1.133) satifying x c (f,) = x,- and x c (tf ) — Xf is easily 
obtained as 


1 

x c (f) = [. Xf sin&)(r — tt) + X; sintti(/f — /)] (1.134) 

sin coT 


where T = tf — ti. Substituting this solution into the action, we obtain (exercise) 


S c = S[x c ] 

111 CO 


2 sin coT 


[(x 2 + xf) cos coT 


■ 2x/x,]. 


(1.135) 



Now the expansion of S [x ] around x — x c takes the form 


1 

S[x c + >’] = S[.r c ] + — 


I 


d?i dt 2 y(ti)y(t 2 ) 


3 2 ,S[x] 

8x(t\)8x(t 2 ) 


(1.136) 


where y(t ) satisfies the boundary condition y(r,) = y{tf) — 0. Note that (1) the 
first-order term vanishes since <5S[x]/Sx = 0 at x = x c and (2) terms of order 
three and higher do not exist since the action is second order in x. Therefore, this 
expansion is exact and this problem is exactly solvable as we see later. 

By noting that 


8 

8x(t\ ) 


f" d, 

— mx(t ) 2 mco 2 x(t) 2 


2 2 


— m — ^x(fi) — mco x(ti) 

dr. 


— m 



x(t\) 


and that 


8x(t\ ) 


— <5(B ~ t 2 ) 


Sx(t 2 ) 

we obtain the second-order functional derivative 


S 2 S[x] 


dr 2 


8x(t\)8x(t 2 ) 

Substituting this into equation (1.136) we find that 


+ co ) <5 (?i — t 2 ). 


S[x c + y] = S[x c ] 

= 5[x c ] + 


m r 

~2\j 

m [ 

ij 


(1.137) 


dri dt 2 y(h)y(t 2 ) 


drp 


+ co" ) S (fi — t 2 ) 


2 co 2 y 2 ). 


(1.138) 


where the boundary condition y(f, ) = y ( tf ) = 0 has been taken into account. 
Since Vx is translationally invariant, 14 we may replace Vx by Vy to obtain 


( Xf , tf\Xi, ti) 


— giSfAc] f Vy f!f dt(y 2 -a/y 2 ) 

Jydi)=y(tf)=o 


(1.139) 


Let us evaluate the fluctuation part 

If=[ Dy e 1 ^ fo dt(y 2 -0) 2 y 2 ) (1.140) 

Jy(0)=y(T)=0 

^Integrating over all possible paths x(t ) with x(t{) = Xf and x(tf) = Xf is equivalent to integrating 
over all possible paths y(t) with y(ti) = y(t f) = 0, where x(t ) = x c (t ) + y(t). 



where we have shifted the t variable so that t\ now becomes t — 0. We expand 
y(t) as 

E titc t 

a„ sin— (1.141) 

neN 

in conformity with the boundary condition. Substitution of this expansion into the 
integral in the exponent yields 



The Fourier transform from y(t) to {a n ) may be regarded as a change of variables 
in the integration. For this transformation to be well defined, the number of 
variables must be the same. Suppose the number of the time slice i s /V + I , 
including t — 0 and t — T, for which there are N — l independent y^. 
Correspondingly, we must put a n — 0 for n > N — 1. The Jacobian associated 
with this change of variables is 


Jn 


dy k 

= det — — = det 

del ii 


sin 



(1.142) 


where fit is the /:th time step when [0, T] is divided into N infinitesimal steps. 

This Jacobian can be evaluated most easily for a free particle. Since the 
transformation {y^l -> {«« 1 is independent of the potential, the Jacobian should 
be identical for both cases. The probability amplitude for a free particle has been 
obtained in (1.104) leading to 


(.</. r|*.o> = ' «p[i£(*/ -*) 2 ] = ' = isl * 


This is written in terms of a path integral as 


a iS[.Y c 


'/ 

JvO 


Vy e 1 ?^ df y 2 . 


' y(0)=y(T)=0 

By comparing these two expressions and noting that 

~T 


N 2 2 2 
a„nn 


m f 1 , . 7 ^ — \ az.n^n 

— / dt y z — > m > — 

2 Jo ^4T 

n = 1 


(1.143) 

(1.144) 


we arrive at the equality 

1 y / 2 


27riT 


■wT-f, 


y(0)=y(T)=0 


Vy^fJ^y 2 


/ 1 \l/2 n / N—\ 2^2 2 

/ 1 \ / 1 t / . citiTt n 

= lim J N — — / da \ . . . da^-i exp [un > — — 

N->oo \27T\8 ) J V , 47 

n = 1 



By carrying out the Gaussian integrals, it is found that 


Lf 2 nV— y /2 

Trip / A n \ tt2 I 


= lim Jn 

./V— >oo 


/ 1 \ w / 2 1 /47rir\ (Ar_1)/2 

~~ V27rieJ (At-l)!\7r 2 ) 


from which we finally obtain, for finite /V, that 

J N = N- N l 2 2- (N -"l 2 jt N -\N-\)\. 


(1.145) 


The Jacobian Jy clearly diverges as /V -y oo. This does not matter at all, 
however, since we are not interested in Jn on its own but a combination with 
other (divergent) factors. 

The transition amplitude of a harmonic oscillator is now given by 


(if, T\xi, 0} = lim Jn I 

At->oo V 27rie 


x / d«i . . . dflfv_i exp 


in T t t-~ 1 2 / I 17 T \2 1 

■—£>'' (t) 

L «=t ’ 


(1.146) 

The integrals over a n are simple Gaussian integrals and easily carried out to yield 


d a n exp 


i mT 2 /«JT \ 2 
4 Q n 


21-V2 


By substituting this result into equation (1.146), we obtain 


(x f , t f \xi, ti) = lim J N — — 

N- s-oo \Z7Tll 


fl[K? 


1/2 At- 1 

„iS[i c ] TT 


/2~| At— i r / T 

n [ -(- 

J n= 1 L V 

(V-l r / \ 2 ” 

n i-(— ) 

n= I L Wj 


The infinite product over n is well known and reduces to 


(1.147) 


lim 1 

N^oo 1 1 

n= 1 _ 


si 


sin&>r 


(1.148) 



Note that the divergence of J v cancelled with the divergence of the other terms 
to yield a finite value. Finally we have shown that 


{Xf, tf\Xi, tj) = ( 

-( 


CO 


\l/2 


2tti sin coT ) 

CO \l/2 

27ri sino^r / 




exp 


1CD 


, . „{(x f +Xf) coscoT -2xiXf}\ 

2 sin coT J J 

(1.149) 


1.4.2 Partition function 

The partition function of a harmonic oscillator is easily obtained from the 
eigenvalue E n = (n + 1 /2 )&>, 


OO 

Xre -/8ff = ^ e -/3(« + l/2)« 
n = 0 


i 

2sinh(/l<w/2) 


(1.150) 


The inverse temperature ft can be regarded as the imaginary time by putting 
\T — p. Then the partition function may be evaluated from the path integral 
point of view. 

Method 1: The trace may be taken over {| jc)} to yield 


/■ 


/■ 


e-^| 

CO 

x) 

_V' 2 

i sinh pco) ) 

exp i 

CO 

— 2i sinh ftco 


( " )‘ /2 

IX 

\2tt sinh fdu>) 

a tanh(/j&)/2) 


( 2x 2 cosh /3&> — 2x 2 ) 

1/2 


1 


2sinh(/)&>/2) 


(1.151) 


where use has been made of equation ( 1 . 149). 

The following exercise serves as a preliminary to Method 2. 


Exercise 1.5. (1) Let A be a symmetric positive-definite n x n matrix. Show that 
J dx] . . . dx„ exp ^ ^ XiAijXj^J = 7r"/ 2 (det A) -1 / 2 = j r"/ 2 ]""[ k j 


(1.152) 



where Xj is the eigenvalue of A. 

(2) Let A be a positive-definite n x n Hermite matrix. Show that 


J dzi dzi . . . dz, n dz„ exp 



7r"(det A) -1 = ;r' ! ]“[ A7 1 . 

i 

(1.153) 


Method 2: We next obtain the partition function by evaluating the path 
integral over the fluctuations with the help of the functional determinant and the 
£ -function regularization. We introduce the imaginary time r — it and rewrite 
the path integral as 


L 


Vy exp 


y(0)=y(T)=0 


l 


Vy exp 


v( 0)=y(P)=0 




y 




y 


where we noted the boundary condition y(0) = y(fi) = 0. Here the bar on T> 
implies the path integration measure with imaginary time. 

Let A be an n x n Hermitian matrix with positive-definite eigenvalues 
Xk (1 < k < n). Then for real variables x±, we obtain from exercise 1.5 that 


fl ( f°° djc*) e-tep**’*"** = fl-L 

k=\ 00 ' k= 1 


1 

Vdet A 


where we neglected numerical factors. This is a generalization of the well-known 
Gaussian integral 

r °° _ 1 1..2 fin 

/ d.xe =J — 

J —oo ' ^ 

for X > 0. We define the determinant of an operator O by the (properly 
regularized) infinite product of its eigenvalues Xk as Det O = ru Xk. i5 Then 
the previous path integral is written as 


f 

'If ( d 2 2 \ ' 

I Vyexp 

.-2 y dr v^ + "h 

Jy( 0)= y (/3)=0 


1 

^/DetDl— d 2 /dr 2 + co 2 ) 


(1.154) 

where the subscript ‘D’ implies that the eigenvalues are evaluated with the 
Dirichlet boundary condition y(0) = y(j3) = 0. 

The general solution y (r) satisfying the boundary condition is written as 


ne N 


mix 


y n sin ■ 


(1.155) 


15 We will use ‘det’ for the determinant of a finite dimensional matrix while ‘Det’ for the (formal) 
determinant of an operator throughout this book. Similarly, the trace of a finite-dimensional matrix is 
denoted ‘tr’ while that of an operator is denoted ‘Tr’ . 



Note that y n e K since y(r) is a real function. Since the eigenvalue of the 
eigenfunction sin(«7rr//3) is X n = (nn/ ft) 2 + or, the functional determinant 
is formally written as 


Det D 



= FL 

n = 1 




(1.156) 


The first infinite product in the last line is written as 



We will evaluate this infinite product through the f -function regularization. Let 
O be an operator with positive-definite eigenvalues X n . Then we have formally 

log Det (D = Tr log (D — log X n . (1.157) 

n 

Now we define the spectral ((-function as 

&>(*) = (1-158) 

n 


The RHS converges for sufficiently large Re .v and Co (- s ) is analytic with respect 
to ,v in this region. Moreover, it can be analytically continued to the whole s-plane 
except at a possible finite number of points. By noting that 


dfoCs) 


ds 


5=0 


^logA.„ 

n 


we arrive at the expression 

Det O = exp 

We replace O by — d 2 /dr 2 in the case at hand to find 


dCo(i) 



j=0- 


E ( flit 

I o 

n> 1 s P 


C(2 S) 


(1.159) 


(1.160) 


where f(2 s) is the celebrated Riemann ^-function. It is analytic over the whole 
,? -plane except at the simple pole at s = 1. From the well-known values 

m = -k f'(0) = -|log(27T) 


(1.161) 



we obtain 


O/dr^°) = 2l0 g 



f(O) + 2?'(O) = -log(20). 


We have finally shown that 


Det D 



= e log(2/3) = 2yS 


and that 



OO 

1 


p = i 



(1.162) 


(1.163) 


The infinite product in this equation is well known but let us pretend that we are 
ignorant about this product. 

The partition function is now expressed as 


Tre-C" = 


■ P= 1 


Pn \ 2 ^~ 1 ' 2 
pn 


it 


<wtanh(/J&>/2) 


1/2 


(1.164) 


By comparing this with the result (1.151), we have proved the formula 


namely 


n 

n = 1 


fico 

nn 


TT 

— sinh(/3&>) 
fico 



sinh(7rx) 

7 XX 


(1.165) 


What about the infinite product expansion of the cosh function? This is given 
by using the path integral with respect to the fermion, which we will work out in 
the next section. 


1.5 Path integral quantization of a Fermi particle 

The particles observed in Nature are not necessarily Bose particles whose position 
and momentum operators obey the commutation relation [p, x] = — i. There are 
particles called fermions whose operators satisfy anti-commutation relations. A 
classical description of a fermion requires anti-commuting numbers called the 

Grassmann numbers. 



1.5.1 Fermionic harmonic oscillator 


The bosonic harmonic oscillator in the previous section is described by the 
Hamiltonian 16 

H = \(a' a + aa') 
where a and a ' satisfy the commutation relations 

[a, a^] =1 [a, a] = [a ^ a + ] = 0. 

The Hamiltonian has eigenvalues (n + 1/2 )co (n e N) with the eigenvector | n): 

H\n) — ( n + j)co\n). 

Now suppose there is a Hamiltonian 

H = \{c^c — cc*)co. (1.166) 

This is called the fermionic harmonic oscillator, which may be regarded as 
a Fourier component of the Dirac Hamiltonian, which describes relativistic 
fermions. If the operators c and c 6 should satisfy the same commutation relations 
as those satisfied by bosons, the Hamiltonian would be a constant H = —col 2. 
Suppose, in contrast, they satisfy the anti-commutation relations 

{c, c 1 } = cc f + c^c — 1 {c, c} = {c 1 / c^} = 0. (1.167) 

The Hamiltonian takes the form 

H = i[c f c - (1 - cJ)]co = (N - i)cu (1.168) 

where N — c 'c. It is easy to see that the eigenvalue of N must be either 0 or 1. 
In fact, N satisfies N 2 = c^cc^c = N, namely N(N — 1 ) = 0. This is nothing 
other than the Pauli principle. 

Let us study the Hilbert space of the Hamiltonian H. Let | n) be an 
eigenvector of H with the eigenvalue n, where n = 0, 1 as shown earlier. It 
is easy to verify the following relations; 

H | 0 } = — 1 | 0 > #| 1 > = || 1 > 

c + |0) = 1 1) c|0> = 0 c f |l) = 0 c|l) = |0>. 

It is convenient to introduce the component expressions 



16 We will drop ~ on operators from now on unless this may cause confusion. 



Exercise 1.6. Suppose the basis vectors have this form. Show that the operators 
have the following matrix representations 



The commutation relation [x, p] = i for a boson has been replaced by 
[x, p] = 0 in the path integral formalism of a boson. For a fermion, the anti- 
commutation relation {c, c^} = 1 should be replaced by {0, 0*} — 0, where 6 and 
0* are anti-commuting classical numbers called Grassmann numbers. 


1.5.2 Calculus of Grassmann numbers 

To distinguish anti-commuting Grassmann numbers from commuting real and 
complex numbers, the latter will be called the ‘c-number’, where c stands for 
commuting. Let n generators {0j, . . . , 0,,} satisfy the anti-commutation relations 

{di,dj} = 0 Vi, j. (1.169) 

Then the set of the linear combinations of {0, } with the c-number coefficients is 
called the Grassmann number and the algebra generated by {0, } is called the 
Grassmann algebra, denoted by A" . An arbitrary element / of A" is expanded 
as 

n 

m = fo + £ + £ fuMj + ■■■ 

i = 1 i< j 

- E ^Ea..../A--a. d- 17 o) 

0 <k<n ' {/} 

where fo, f , fj , . . . and are c-numbers that are anti-symmetric under the 

exchange of any two indices. The element / is also written as 

m= £ jl (1.17D 

k =0.1 


Take n = 2 for example. Then 

/ (^) — fo + f\0\ + fi 02 + fnO \02 
— foo + fioOi + foi 02 + fnO i 02 - 

The subset of ), n which is generated by monomials of even (resp. odd) power in 
9k is denoted by A" (A" ): 

A" = A^ © A” . 


(1.172) 



The separation of A" into these two subspaces is called Z 2 -grading. We call an 
element of A" (A") G-even (G-odd). Note that dim A" = 2" while dim A" = 
dim A'i = 

The generator Ok does not have a magnitude and hence the set of Grassmann 
numbers is not an ordered set. Zero is the only number that is a c-number as well 
as a Grassmann number simultaneously. A Grassmann number commutes with a 
c-number. It should be clear that the generators satisfy the following relations: 

** = o 

0ki0k 2 ■ ■ ■ 0 K — £kik 2 ...k n 0l62 ...0 n (1.173) 

6k x 0k 2 ---0k m =Q (m > n), 


where 


s ki...k„ 


+ 1 if [k \ ... k n } is an even permutation of { 1 ... n] 
— 1 if {k \ . . . k n } is an odd permutation of { 1 . . . n} 
0 otherwise. 


A function of Grassmann numbers is defined as a Taylor expansion of the 
function. When n — I . for example, we have 

e 0 = 1 + 6 


since higher-order terms in 6 vanish identically. 


1.5.3 Differentiation 

It is assumed that the differential operator acts on a function from the left: 

d6: 3 


d6j dOi 


= —Oj = Sij. 


(1.174) 


It is also assumed that the differential operator anti-commutes with 6k- The 
Leibnitz rule then takes the form 


2- ( W = a A ek - $IJ ji = SlA - s, t ej. 


'3 r „ d0 k 


Exercise 1.7. Show that 


3 3 3 3 

WiWj + WjWi ~ 


(1.175) 


(1.176) 


It is easily shown from this exercise that the differential operator is nilpotent 


3 6: 


= 0. 


(1.177) 


Exercise 1.8. Show that 


3 


3 



1.5.4 Integration 


Supprisingly enough, integration with respect to a Grassmann variable is 
equivalent to differentiation. Let D denote differentiation with respect to a 
Grassmann variable and let I denote integration, where integration is understood 
as a definite integral. Suppose they satisfy the relations 

(1) ID = 0, 

(2) DI = 0, 

(3) D{A) = 0 =£> I{BA) = I(B)A, 


where A and B are arbitrary functions of Grassmann variables. The first relation 
states that the integration of a derivative of any function yields the surface term 
and it is set to zero. The second relation states that a derivative of a definite 
integral vanishes. The third relation implies that A is a constant if D(A ) = 0 and 
hence it can be taken out of the integral. These relations are satified if we take 
/ oc D. Here we adopt the normalization I = D and put 

f df(0 ) 

Jd0f(0) = ^^. (1.179) 

We find from the previous definition that 



If there are n generators { Ok 1 , equation (1.179) is generalized as 

/ d d 3 

d01 d02 . ■ .d On f (01, 02, ... , 9n) = • ■ • — m, 02,..., 9n). 

ou\ OU2 3c *n 

(1.180) 

Note the order of d Ok and 3/3 Ok. 

The equivalence of differentiation and integration leads to an odd behaviour 
of integration under the change of integration variables. Let us consider the case 
n — 1 first. Under the change of variable O' — aO (a e Q, we obtain 


3/(9) 3/(9'/a) 

] de f,e> = — = =“ J ie f(0 /“) 



which leads to d O' — (\/a)dO. This is readily extended to the case of n variables. 
Let 6i — >■ 0- = ciij Oj . Then 


/ 


ddi...9„m = 4----4-/W 

Ou\ 


A ML d9 k 3 3 , , 

^Mi d0nde' kl ML 

A a 3 , 

— / . e *l - bi 0 *i 1 ■ ■ - a k n n TTT ■ • ■ tT/v - / ( fl ® ) 
fc=l 30 *i ^ 


= detfl 




Accordingly, the integral measure transforms as 

d&i d 02 ■ ■ ■ 6„ — det a d0j d6'~, . . . d0' n . 


(1.181) 


1.5.5 Delta-function 

The 5-function of a Grassmann variable is introduced as 

j d0 8(0-a)f(0) = f(a) (1.182) 

for a single variable. If we substitute the expansion /(0) — a + bO into this 
definition, we obtain 

j dO 8(0 — a)(a + bO) = a + ba 
from which we find that the 5-function is explicitly given by 


5(0 -a) = 0-0'. (1.183) 

Extension of this result to n variables is easily verified to be (note the order of 
variables) 

8 n (0 -a)= (On - oc n ) ... (02 - a 2 )(0t - «i). (1.184) 

The integral form of the 5-function is obtained from 

J d£e i?0 = J d£ (1 + i£0) =i0 


as 


5(0) = 0 = -i / d£e ife . 


(1.185) 



1.5.6 Gaussian integral 

Let us consider the integral 


■/ 


/ = / d0,* d6\ . . . d0; 




(1.186) 


where {0, } and {0*} are two sets of independent Grassmann variables. The n x n 
c-number matrix M is taken to be anti-symmetric since 0, and Of anti-commute. 
The integral is evaluated with the help of the change of variables Oj — JU MjjOj 
as 


I = 


det M J d0* d0[ . . . d0* d0' 


e _ T,i e * e 'i 


[/• 


= det Ml / d0* d0 (1 + 0'0*) 
= det M. 


(1.187) 

We prove an interesting formula as an application of the Gaussian integral. 


Proposition 1.3. Let a be an anti-symmetric matrix of order 2 n and define the 

Pfaffian of a by 


pf( «) = s Sn( p ) a hi 2 


* l 2 n—\ l 2 n * 


Permutations of 

1*T *2j»1 


(1.188) 


Then 


det a = Pf(a)" 


(1.189) 


Proof. Observe that 

I = J d 02 „ . . . d0i exp 

= Pf(fl). 

Note also that 


2 0< ai j0j ~ 2>i n \ J 2n • ' • ^1 ^ 0' a ‘j0j^j 


I 1 = j d0 2n . . . d0i d 02 „ . . . d0i exp i ^(0,a !; -0 ; - + 0-a,y0') 


Under the change of variables 


m = + of). 4 = — 7=7(0* - 0;), 



we obtain the Jacobian = (—1)" and 


\9 j +9' i 9' j = r, i r,*-r,*r, i 


d mn - . . d m Arj* n . . . di/J = (-1)" din dr,*... d r, 2 „ d rj* 2n , 


from which we verify that 


Pf(a) 2 = J dm dr,*... d r, 2n d r, 2 „ ex P \ X! ^* a (/ Vj = 

L ij J 


= detfl. 


□ 


Exercise 1.9. (1) Let M be a skewsymmetric matrix and K, be Grassmann 
numbers. Show that 

J d0\. . . d e n t -\'eM-e+<K-e = 2 «/2 V detM e -‘k-m-'-k/a (I. 190 ) 

(2) Let M be a skew-Hermitian matrix and K, and K* be Grassmann numbers. 
Show that 

J d9fdOi...dO*dOne~ 0t ' M ' e+Kt ' 9+et ' K = detM e Kt ' M ~ lK . (1.191) 


1.5.7 Functional derivative 

The functional derivative with respect to a Grassmann variable can be defined 
similarly to that for a commuting variable. Let be a Grassmann variable 
depending on a c-number parameter t and F[\j/ (f)] be a functional of 1 Jr. Then we 
define 

SF[\lr(t)] 1 

. . . . = + eS(t - 5 )] - JWf)]}, (1.192) 

01 jr(s) e 

where e is a Grassmann parameter. The Taylor expansion of F[i/f(r) — eS(t — 5)] 
with respect to e is linear in s since e 2 = 0. Accordingly, the limit s —> 0 is 
not necessary. A word of caution: division by a Grassmann number is not well 
defined in general. Here, however, the numerator is proportional to e and division 
by e simply means picking up the coefficient of e in the numerator. 

1.5.8 Complex conjugation 

Let {$,•} and ( 0* } be two sets of the generators of Grassmann numbers. Define the 
complex conjugation of 9j by ($,)* = 0* and (0*)* = 6i. We define 

(9i9j)* — 9*9*. (1.193) 

Otherwise, the real c-number 0, 0* does not satisify the reality condition 

0 9i9f)* = 9i9 *. 



1.5.9 Coherent states and completeness relation 

The fermion annihilation and creation operators c and c ^ satisfy the anti- 
commutation relations {c, c} = {c^, c^} — 0 and {c, c^} = 1 and the number 
operator N = c^c has the eigenvectors |0) and |1>. Let us consider the Hilbert 
space spanned by these vectors 

T-L — Span{ 1 0) , 1 1 } } . 

An arbitrary vector | /} in T-L may be written in the form 

I/) = |0)/o + I l)/i, 


where /q, f\ e C. 

Now we consider the states 

|0> = |0> + 1 1)0 (1.194) 

<0| = (O|+0*<1| (1.195) 

where 9 and 9* are Grassmann numbers. These states are called the coherent 
states and are eigenstates of c and c ’ respectively, 

c\9) = |O}0 = 1 9)9, (0|c f = 0*<O| = 0*(0|. 

Exercise 1.10. Verify the following identities; 

(0'|0> = 1 +9'*9 = e 9 '* 9 , 

(0\f) = ft + 0* ft, 

(0|c f |/> = (0|l)/o = 0*fo = 0*(0|/>, 

(0|d/> = <0|O)/i = 

Let 

/z(c, c f ) = /zoo + h\oc^ + /zoic + hnc^c h,j e C 
be an arbitrary function of c and L. Then the matrix elements of h are 

(0|/z|0> = /zoo (0|/z|l> = /z 01 (l|/z|0> = h 10 (l|/z|l) = hoo + h n . 

It is easily found from these matrix elements that 

(0|/z|0'> = + Q*h w + h w 9’ + 9*6' h n )e e * e ' . (1.196) 

Lemma 1.3. Let |0) and (0 1 be defined as before. Then the completeness relation 
takes the form 

J d9*dd\9){9\e- e * d 


= /. 


(1.197) 



Proof. Straightforward calculation yields 


J d6>*d<9|(9>(<9|e“ 0 *' 


= J d<9*d<9(|0} + |1)0)«O| +6>*(1|)(1 -0*0) 

= J d6* dO (|0)(0| + 1 1>0<O| + |O)0*(1| + 1 1)00* (1 1) (1 — 9*6) 
= | 0 )( 0 | + | 1 >< 1 | = /. 


□ 


1.5.10 Partition function of a fermionic oscillator 

We obtain here the partition fuction of a fermionic harmonic oscillator as an 
application of the path integral formalism of fermions. The Hamiltonian is 
H = (c'c — 1/2 )co, which has eigenvalues ±<w/2. The partition function is then 

l 

Z(P) = Tr e,~P H = \n) = e^ /2 + e“^ w/2 = 2cosh( / Sw/2). 

n= 0 

(1.198) 

Now we evaluate Z (ft ) in two different ways using a path integral. We start our 
exposition with the following lemma. 

Lemma 1.4. Let H be the Hamiltonian of a fermionic harmonic oscillator. Then 
the partition function is written as 

Tr e~P H — J dO* d6(—0\&~^ H \0)6~ e * e . (1.199) 

Proof. Let us insert the completeness relation (1.197) into the definition of a 
partition function to obtain 

Z(f i)= ^ (n\&~P H \n) 

n= 0,1 

d6* dde~ e * e {n\0)(0\e~P H \n) 

dd* dO (1 — 6>*6>)(</i|0> + (n\l)0)((0\e~P H \n) + 0* \n)) 

= I d9*dd(l-6*0)[(0\e-P H \n){n\0) 

n ^ 

- 6*0(l\e~P H \n)(n\ 1) + 6>{0|e“^|n><n|l> + 6>*<l|e“^|«><n|0>]. 


= E/ 

n J 

= £/ 

n J 



The last term of the last line does not contribute to the integral and hence we may 
change 0* to — 0*. Then 


= J d0*d0(l — 6*0)[(0\e~P H \n)(n\0) 

n * 

-0*0{l\e~ pH \n){n\l) + 0(O|e“^ ff |n)(n|l) - 0*(l|e _ ^|n)(n|O>] 

= J d0*d0e~ e * 8 (-0\e~ pH \0). □ 


Accordingly, the coordinate in the trace is over anti-periodic orbits. The 
Grassmann variable is 9 at r = 0 while — 0 at r = p and we have to impose an 
anti-periodic boundary condition over [0, /l] in the trace. 

Use the expression 


e~P H = lim (1 -0H/N) 

N^-oo 


N 


and insert the completeness relation at each time step to find 


Z(P) = lim / d0* d0 e" 
N—>oo , 


— lim 

N-+oo 


./■ 

/ " n / de* k 


(- 0 |( 1 - PH/Nf\6) 


d Ok e 


E N— 1 n*n 
n = 1 d n d n 


X (-01(1 - sH)\6n-i){6n-i I • • - 101>(01|(1 - bH) |0> 

N 


lim [ FT d °k e“ 8 " 8n 

N^ooJ 1 L 
J k— I 


x (0*1(1 - eff)|0*_j>(0*-i| • ■ ■ |6»t><6»i|(l - bH ) | - 0*> 


where we have put e = /3/iV and 0 = —6n = 0o, 0* = —0^, = 0 q. 
Each matrix element is evaluated as 


(0*|(1 -eff)|0*_t) = (0*|0*_i> 


^ _ (0k\H\6k-i) 

(0*|0*-i> 


~ (0 jt |0 l( ._ 1 >e _£<e * |H|ft - 1>/<ftlft - 1> 

= e ^0*-i e -£®(0^-l-l/2) 



The partition function is now expressed in terms of the path integral as 


Z(/3) = lim e^ /2 fl f dO? d6 k e -£»=i^"e (1 -«») £,=i e,A-i 
N^o o 1 1 J 

k=\ J 

N r 

= e f}w/1 lim Ff / d 9 f d 9 k e~^n=M( 0 n-e n -i)+e° 1 e*e n -i} 

N^oo 1 L J k 
k= 1 J 

iV 

_ g^"/ 2 lim rr / d6»?d^e _0t ' B ' e , (1.200) 

iV^oo 1 1 J 


where 


/ 01 \ 

&2 


6> t = 


\ On / 

( 1 0 ... 0 ->■ \ 

>- 1 0 ...0 

0 >■ 1 ...0 


Bn = 


\ 0 0 .. 


1 ) 


with y = — 1 + sco in the last line. We finally find from the definition of the 
Gaussian integral of Grassmann numbers that 


Z(/l) = e^®/ 2 lim detBiv = e^ /2 lim [1 + (1 - Pco/N) n ] 
iV— >o o N—>oo 

= e^®/ 2 (l + e~^ M ) = 2cosh A/l&>. 


( 1 . 201 ) 


This should be compared with the partition function (1.151) of the bosonic 
harmonic oscillator. 

This partition function is also obtained by making use of the f -function 
regularization. It follows from the second line of equation (1.200) that 


Z(P) = e^" /2 lim 

N-*oo 



d o* d 0 k t~ l(i — Wn-d„-i)ls+u>e*en\ 


= e ^/ 2 
= e^/ 2 


J Vd*V0ex ] p - 


dr 9* 


DetAPBC ( 1 — sco) - — h u> 
dr 


(1 - sco)— + co 

dr 


Here the subscript APBC implies that the eigenvalue should be evaluated for the 
solutions that satisfy the anti-periodic boundary condition 9 (ft) = —9(0). It 



might seem odd that the differential operator contains e. We find later that this 
gives a finite contribution to the infinite product of eigenvalues. Let us expand 
the orbit 6(r) in the Fourier modes. The eigenmodes and the corresponding 
eigenvalues are 


( icU2n + l)r \ 

exp V 1 j ’ 


(1 — eco) 


n\(2n + 1) 

1 


+ co, 


where n = 0, ±1,±2, .... It should be noted that the coherent states are 
overcomplete and that the actual number of degrees of freedom is N , which is 
related to e as s — fi/N. Then we have to truncate the product at — N/4 < k < 
N /4 since one complex variable has two real degrees of freedom. Accordingly, 
the partition function takes the form 


N/4 


Z(B) = e^/ 2 lim FI 

o 1 L 


k=-N/4 
oo 


n(2n — 1) 

i(l — eco) h co 


— e ^®/2g-^w/ 2 

k=l 

°° r 7t(2k — l) n2 °° 


(2n(n- 1/2) V , 2 


= n 


k= 1 


n 


- 1 n= 1 L 


1 + 


j}<o 


it(2n — 1) 


The first infinite product, which we call P, is divergent and requires 
regularization. Note, first, that 


OO 

logP = £2 log 
k= 1 


2jt(k- 1/2) 
?> 


Define the corresponding ^-function by 

" 2ic (k — 1 /2) 




k= 1 


with which we obtain P — e . Here 

1 




k = o 


(k + a) s 



Us, 1 / 2 ) 


(0 < a < 1) 


( 1 . 202 ) 


is the generalized ^-function (the Hurwitz f -function). The derivative of / (.v) 
at s — 0 yields 


l’( 0) - log 


2tt 


f (0, 1/2) + %'(0, 1/2) = — — log 2, 



where use has been made of the values 


Finally we obtain 


4T(0, 1/2) = 0 f r (0, 1/2) = — 2 log 2. 


P = e -2 f' (0) = e log2 = 2. 


(1.203) 


Note that P is independent of f J > after regularization. 

Putting them all together, we arrive at the partition function 


zm = 2 


fia> 

7r(2« — 1) 


(1.204) 


By making use of the well-known formula 


cosh - = 
2 


OO 

=n 1 

n= 1 _ 


n 2 (2n — l) 2 


we obtain 


Bco 

Z(P) — 2 cosh 


(1.205) 


(1.206) 


Suppose, alternatively, we are ignorant about the formula (1.205). Then, 
by equating equation (1.201) with equation (1.204), we have proved the formula 
(1.205) with the help of path integrals. This is a typical application of physics 
to mathematics: evaluate some physical quantity by two different methods 
and equate the results. Then we often obtain a non-trivial relation which is 
mathematically useful. 

1.6 Quantization of a scalar field 


1.6.1 Free scalar field 

The analysis made in the previous sections may be easily generalized to a case 
with many degrees of freedom. We are interested, in particular, in a system with 
infinitely many degrees of freedom; the quantum field theory (QFT). Let us 
start our exposition with the simplest case, that is, the scalar field theory. Let 
<t>(x) be a real scalar field at the spacetime coordinates x = (x, x°) where x is the 
space coordinate while x° is the time coordinate. The action depends on r/j and its 
derivatives 9 |U 0(x) = 30U)/3.r /i : 


J d* £(</>, 9 ,_,<!)). 


(1.207) 


17 The first formula follows from the relation £(s, 1/2) = ( 2 s — l)£(s), which is derived from 
the identity £(s, 1/2) + £(s) = 2 s J^^i[l/(2w — l) 1 * + l/(2n) s ] = 2 S t;(s). The second formula 
is obtained by differentiating £(s, 1/2) = ( 2 s — 1)^(5) with respect to .9 and using the formula 
£(0) = —1/2. 



Here C is the Lagrangian density. The Euler-Lagrange equation now takes the 
form 


3 / d£ \ d£ 

dx^ \ 3 (3/u0) / 30 

The Lagrangian density of a free scalar field is 


(1.208) 


£o(0, 3„0) = -1(3^03^0 + m 2 0 2 ). (1.209) 


The Euler-Lagrange equation derived from this Lagrangian density is the Klein- 
Gordon equation 

(□-/« 2 )0 = O, (1.210) 

where □ = 3 M 3 M = — 3 q + V 2 . 

The vacuum-to-vacuum amplitude in the presence of a source J has the path 
integral representation (0, oo|0, —oo) J oc Zq[ 7], where 


Z 0 [/] = 


J Vcp exp 



£q + /0 + — £0 



( 1 . 211 ) 


where the ie term has been added to regularize the path integral. 18 Integration by 
parts yields 


-f 


Zq[J] = / D0exp 


i J dx (j{cp(\J — m 2 )<p + i £0 2 } + J(f>) 


( 1 . 212 ) 


Let 0 C be the classical solution to the Klein-Gordon equation in the presence 
of the source, 


(□ — m~ + ie)0 c = — J. 
The solution is easily found to be 

0 c (x) = - J dy A(.r - y)J(y) 

where A(x — y) is the Leynman propagator 


A(x - y) = 


bs 


Jk(x-y) 


A d k ■ 


(2n) d J k 2 + m 2 — ie 
Here d denotes the spacetime dimension. Note that A (x — y) satisfies 
(□ — in 2 + ie)ACv — y) — S d (x — y). 

It is easy to show that (exercise) the functional Zq[ 7] is now written as 


(1.213) 


(1.214) 


(1.215) 


Z 0 [7] = Z 0 [0] exp 


^ J dx dy J(x)A (x - y)J(y) 


(1.216) 


^Alternatively, we can introduce the imaginary time r = ix® to Wick rotate the time axis. 



It is instructive to note that the propagator is conversely obtained by the functional 
derivative of Zq[7], 


A(r - y ) = 


i S 2 Z 0 [J] 
Zq[ 0] SJ(x)SJ(y ) 


7=0 


(1.217) 


The amplitude Zo[0] is the vacuum- to- vacuum amplitude in the absence of 
the source and may be evaluated as follows. Let us introduce the imaginary time 
x 4 — r — i.Y°. Then, we obtain 


Zq[0] = J V<p exp \ J 


dx </>(□ — m 2 )cp 


= [Det(D - m )] 


2^1 1/2 


(1.218) 


where □ = d 2 + V 2 and the deteminant is understood in the sense of section 1 .4, 
namely it is the product of eigenvalues with a relevant boundary condition. 

A free complex scalar field theory has a Lagrangian density 

Co = - m 2 1(/>| 2 + J<(>* + J*<j> (1.219) 


where the source terms have been included. The generating functional is now 
given by 


Z 0 [J, J*] = J V<pV<p* exp 
= J V<pV<p* exp 

The propagator is now given by 
A ( a ; - V ) = 


i / dx (Co ~ ism) 


'■/ 

:*/■ 


i / dr {</> (□ — m~ + ie)</> + J (p + J(p } 

(1.220) 


i S 2 Zq[J, 7*] 


7 = 7*=0 


Z 0 [0, 0] 8J*(x)8J(y) \ 

By substituting the Klein-Gordon equations 

(U-m 2 )<p c = -J (□ -m 2 )(p* = -J* 
we separate the generating functional as 


Zq[7, /*] = Zq[ 0, 0] exp 


-/ 


dr d>> 7 * (r ) A (r — y)J(y) 


( 1 . 221 ) 


( 1 . 222 ) 


(1.223) 


where 


Zq[0, 0] — J V(pV(p*ex p — i J 


= [Det(D — m 2 )] -1 . 


dxcp*(n — nr — i s)(p 


(1.224) 


Wick rotation has been made to occur at the last line. 



1.6.2 Interacting scalar field 

It is possible to add interaction terms to the free field Lagrangian (1.209), 


£(0, 9 n</>) = £o(0, 9^0) - V ((p). 


(1.225) 


The possible form of V (<f>) is restricted by the symmetry and renormalizability of 
the theory. A typical form of V is a polynomial 


v( 4 >) = V' 

n\ 


(n > 3 , n € 


where the constant gel controls the strength of the interaction. The generating 
functional is defined similarly to the free theory as 


Z[J] = 


= J V(j) exp i J dx { ?</>(□ — m 2 )<p — V (<p) + Jcp} 


(1.226) 


The presence of V (0) makes things slightly more complicated. It can be handled 
at least perturbatively as 


Z[J] = / Vtp exp T— i j 


-i dx V (0) 


i / dx {£o + 


= exp 
= exp 


-/ 


i / dxV 


1 


i SJ(x ) 


H 

-±h~J 


, 1 5 

dr V — 


dxk 


i SJ(x) 

(~i) k 


:■/ 

J Vcp exp i J 
Z 0 [J] 


i / dx {£q + /(/>} 


*=o 
x V 


k! 


1 


i <57(xi) 


...V 


1 


i SJ(xk) 


ZoUl 


(1.227) 


The generating functional Z[J] generates the vacuum expectation value 
of the T -product of field operators, also known as the Green function 

G„(xi, . . . , x„), as 


G„(x i ,x n ) = (O|7’[0(xi) . . .<j)(x n )]\0) 

( -i) n S n 


SJ(x i) . . .SJ(x n ) 


Z[J] 


(1.228) 


7=0 


Since this is the «th functional derivative of Z[J] around ./ = 0, we obtain the 
functional Taylor expansion of Z[J ] as 


oo i r » . 

z[j] = x! ~ n / dx ' j(x ’) 

= (0|7’e^ dx ' /(x) '^’ (x) |0). 


{{)\T[<p(x \) . . .</>(x„)]|0) 


(1.229) 



The connected n-point functions are generated by W \ J j defined by 


Z[J] = t~ wm . 

The effective action l'[f/; c |] is defined by the Legendre transformation 


where 


T [M s w[j] 


<t> cl = {<P) J = 


/ 


dr dxJ(p c \ 

SW[J] 

SJ 


The functional T[</> c i] generates one-particle irreducible diagrams. 


(1.230) 


(1.231) 

(1.232) 


1.7 Quantization of a Dirac field 


The Lagrangian of the free Dirac field i jr is 

= (1.233) 


where 0 = y^ L 3^. In general A = y^A^. Variation with respect to i/f yields the 

Dirac equation 

(i0-m)iA = O. (1.234) 


The Dirac field, in canonical quantization, satisifes the anti-commutation 
relation 

{i/r(x 0 , x), yTx 0 , y)} = <5 (jc — y). (1.235) 

Accordingly, it is expressed as a Grassmann number function in path integrals. 
The generating functional is 


Zo[ri, t]] = 


J Vx/rVi// exp 


J dx (i/ r (i0 — m)i[r + i j/rj + rjifr) 


(1.236) 


where i]. rj are Grassmannian sources. 

The propagator is given by the functional derivative with respect to the 
sources, 


Six - y) 


_ <5 2 Zq[?7, rj\ 

8rj(x)8ri(y) 

1 f &' kx 

is? / d "' * =«+'»+ ie)a( * - y> 

(1.237) 


where A(x — y) is the scalar field propagator. 

By making use of the Dirac equations 

(i0 — m)i{r = —r) (i 0 + m) = rj 


(1.238) 



the generating functional is cast into the form 


Z 0 bh 'll = Z 0 [0, 0]exp 


-/ 


i / dx dy rj(x)S(x — y)rj(y) 


After Wick rotation r = i.x°, the normalization factor is obtained as 
Zq[0, 0] = Det(i0 — m) = ]” [ Xj 


(1.239) 


(1.240) 


where a, is the ; th eigenvalue of the Dirac operator ij — m . 


1.8 Gauge theories 

At present, physically sensible theories of fundamental interactions are based 
on gauge theories. The gauge principle — physics should not depend on how we 
describe it — is in harmony with the principle of general relativity. Here we give 
a brief summary of classical aspects of gauge theories. For further references, the 
reader should consult those books listed at the beginning of this chapter. 

1.8.1 Abelian gauge theories 

The reader should be familiar with Maxwell’s equations: 


div B = 0 

(1.241a) 

SB 

h curl E — 0 

dt 

(1.241b) 

div E — p 

(1.241c) 

dE 

curl E = — i . 

dt J 

(1.241d) 


The magnetic field B and the electric field E are expressed in terms of the vector 
potential A ^ — (0, A) as 

9 A 

B = curl A E = grad (f>. (1.242) 

dt 

Maxwell’s equations are invariant under the gauge transformation 


— >■ + d^x 


(1.243) 


where / is a scalar function. This invariance is manifest if we define the 

electromagnetic field tensor F /lv by 


IAV 


/ 

0 

~E X 

-Ey 

~Ez \ 


E x 

0 

B z 

-By 


Ey 

~B Z 

0 

B z 

V 

E z 

By 

— B x 

0/ 


(1.244) 



From the construction, F is invariant under (1.243). The Lagrangian of the 
electromagnetic fields is given by 

£em = -\F liV F ILV + Anj* (1.245) 

where j 11 = (p, j). 

Exercise 1.11. Show that (1.241a) and (1.241b) are written as 

3? F ^ + d„ F vH + d v F f „ = o (1 ,246a) 

while (1.241c) and (1.241d) are 

d v F' iv = j 11 (1.246b) 

where the raising and lowering of spacetime indices are carried out with the 
Minkowski metric p — diag(— 1, 1, 1, 1). Verify that (1.246b) is the Euler- 
Lagrange equation derived from (1.245). 

Let i fr be a Dirac field with electric charge e. The free Dirac Lagrangian 

£ 0 = iA(iK%+'»)iA (1-247) 

is clearly invariant under the global gauge transformation 

f -> e _ie “^ f -» iAe ira (1.248) 

where a e 1 is a constant. We elevate this symmetry to invariance under the local 
gauge transformation, 

f e ~ iea(x V f -» ire iea(x \ (1.249) 

The Lagrangian transforms under (1.249) as 

\jr(i + m)\ls — >■ i/r(i + ey^d^a + m)ir. (1.250) 

Since the extra term ed^a looks like a gauge transformation of the vector 
potential, we couple the gauge field A ^ with t// so that the Lagrangian has a local 
gauge symmetry. We find that 

C = Miy' i ’(d lt -ieA ll ) + mW (1.251) 

is invariant under the combined gauge transformation, 

ir -+ yfr' = e- ie “ ( *V f f' = f&' eaix) 

A[i > A ^ = A^ dftCi(x). 


(1.252) 



Let us introduce the covariant derivatives, 

= d fl - \eA lt V; = d„ - i eA[ L . (1.253) 

The reader should verify that V ;/ \jj transforms in a nice way, 

V^' = e~ iea(x) y^. (1.254) 

The total quantum electrodynamic (QED) Lagrangian is 

£qed = + rn)f. (1.255) 

Exercise 1.12. Let (p — (<p\ + i(p2)/V2 be a complex scalar field with electric 
charge e. Show that the Lagrangian 

£ = ^(V^O?^) + mV* d-256) 

is invariant under the gauge transformation 

A#t _ 3#ta(jc) . (1.257) 

1.8.2 Non-Abelian gauge theories 

The gauge transformation just described is a member of a U(l) group, that 
is a complex number of modulus 1, which happens to be an Abelian group. 
A few decades ago, Yang and Mills (1954) introduced non- Abelian gauge 
transformations. At that time, non-Abelian gauge theories were studied from 
curiosity. Nowadays, they play a central role in elementary particle physics. 

Let G be a compact semi-simple Lie group such as SO (AO or SLJ(A). The 
anti-Hermitian generators { T a } satisfy the commutation relations 

[T a , Tp\ = f a p y Ty (1.258) 

where the numbers f a p Y are called the structure constants of G. An element U 
of G near the unit element can be expressed as 

U = exp(-0 a T a ). (1.259) 

We suppose a Dirac field \[r transforms under f/e Gas 

i/r — > Ux[r x/r —> \[rU^ . (1.260) 

[Remark: Strictly speaking, we have to specify the representation of G to which 
\[/ belongs. If readers feel uneasy about (1.260), they may consider i fr is in the 
fundamental representation, for example.] 

Consider the Lagrangian 


£ = fliy^idu + gAf,) + m]f 


( 1 . 261 ) 



where the Yang-Mills gauge field takes its values in the Lie algebra of G, that 
is, A,j, can be expanded in terms of T a as A ia = ,4 “ T a . (Script fields are anti- 
Hermitian.) The constant g is the coupling constant which controls the strength 
of the coupling between the Dirac field and the gauge field. It is easily verified 
that £ is invariant under 

\[r \j/' — Ui]t l/r —> = {/, rU 1 

(1.262) 

Aft -* = UA„U^ + g-'Ud^U t. 

The covariant derivative is defined by = 3^ + gA/, as before. The covariant 
derivative V /t ijj transforms covariantly under the gauge transformation 

(1.263) 

The Yang-Mills field tensor is 

9 " iiv = dnA v — d v A/j, + glA^, A v ]. (1.264) 

The component F flv a is 

V = 3 - d v A„ a + gffiy a A^A v Y. (1.265) 

If we define the dual field tensor *T /iv , = jS in , K ,A K> \ it satisfies the Bianchi 
identity, 

v = 3 /t * ^ lv + glAp, = 0. (1.266) 

Exercise 1.13. Show that T /iv transforms under (1.262) as 

9> -> U?„ V UK (1.267) 

From this exercise, we find a gauge-invariant action 

£ym = — ^tr(3^ l ’3» (1.268a) 

where the trace is over the group matrix. The component form is 

£ym = ~\F^ va F,J tr {T a T p ) = \F' lva F, lva (1.268b) 

where we have normalized {T a } so that Vc(T a T p ) = The field equation 

derived from (1.268) is 


F>n^fi v — 3 i_l“ 5 jJLV + g[A/!, — 0 . 


(1.269) 



1.8.3 Higgs fields 


If the gauge symmetry is manifest in our world, there would be many observable 
massless vector fields. The absence of such fields, except for the electromagnetic 
field, forces us to break the gauge symmetry. The theory is left renormalizable if 
the symmetry is broken spontaneously. 

Let us consider a U(l) gauge field coupled to a complex scalar field 0, whose 
Lagrangian is given by 

£ = + (^0)^0) - k(0 f 0 - v 2 ) 2 . (1.270) 

The potential V(0) = A(0'0 — v 2 ) 2 has minima V — 0 at |0| — v. The 
Lagrangian (1.270) is invariant under the local gauge transformation 

A^A^-d^a 0-> e - ira 0 0 f e ira 0 f . (1.271) 


This symmetry is spontaneously broken due to the vacuum expectation value 
(VEV) (0) of the Higgs field 0. We expand 0 as 

0 = -Wit + p{x)]t la(x),v \=[v + p(x) + ia(x)] 

V2 V2 

assuming v 0. If v ^ 0, we may take the unitary gauge in which the phase of 
0 is ‘gauged away’ so that 0 has only the real part, 


1 

0 (x) = — F (t> + p(x)). 

>/2 

If we substitute (1.272) into (1.270) and expand in p, we have 

c = - \F^F^ V + fapd^p + W-A^iv 2 + 2 vp + p 2 ) 
- \X(Av 2 p 2 + 4 up 3 + p 4 ). 

The equations of motion for A lt and p derived from the free parts are 


(1.272) 


(1.273) 


3 V F V/1 + 2 e~v~An = 0 d^d^p + 2Xi rp = 0. 


(1.274) 


From the first equation, we find A tl must satisfy the Lorentz condition d fl A ,x = 0. 
The apparent degrees of freedom of (1.270) are 2(photon) + 2(complex scalar) = 
4. If VEV ^4 0, we have 3 (massive vector) + l(real scalar) = 4. The field Ao has 
a mass term with the wrong sign and so cannot be a physical degree of freedom. 
The creation of massive fields out of a gauge field is called the Higgs mechanism.. 


1.9 Magnetic monopoles 

Maxwell’s equations unify electricity and magnetism. In the history of physics 
they should be recognized as the first attempt to unify forces in Nature, In spite 
of their great success, Dirac (1931) noticed that there existed an asymmetry in 
Maxwell’s equations: the equation div B — 0 denies the existence of magnetic 
charges. He introduced the magnetic monopole, a point magnetic charge, to make 
the theory symmetric. 



1.9.1 Dirac monopole 

Consider a monopole of strength g sitting at r — 0, 

div B — Art gif’ (r). (1.275) 

It follows from A(l/r) = — 47T(5 3 (r) and V(l/r) = —r/r 3 that the solution of 
this equation is 

B=gr/r 3 . (1.276) 

The magnetic flux <l> is obtained by integrating B over a sphere S of radius R so 
that 

<t> = <j> B ■ dS = 4ng. (1.277) 

What about the vector potential which gives the monopole field (1.276)? If 
we define the vector potential A N by 


A N V = 


-gy 
r(r + z ) 


4 n - 

A y — 


r(r + z ) 


A n , = 0 


we easily verify that 


curlA N = gr/r 3 


■ 4jtgS(x)S(y)6(-z). 


(1.278a) 

(1.279) 


We have curl A N = B except along the negative ’-axis ( 9 = tc). The singularity 
along the ’-axis is called the Dirac string and reflects the poor choice of the 
coordinate system. If, instead, we define another vector potential 


A S v = 


gy 

r(r - z) 


A s - 

A y — 


r{r — z) 


A z = 0 


(1.278b) 


we have curlA s = B except along the positive z-axis ( 9 = 0) this time. The 
existence of a singularity is a natural consequence of (1.277). If there were a 
vector A such that B — curl A with no singularity, we would have, from Gauss’ 
law, 

<& = <f) B ■ dS = (D curl A ■ dS — div(curl A) dV — 0 

Js Js Jv 

where V is the volume inside the surface S. This problem is avoided only when 
we abandon the use of a single vector potential. 


Exercise 1.14. Let us introduce the polar coordinates (r, 6, (p). Show that the 
vector potentials A N and A s are expressed as 


■ N/ \ g(l-COS0), 

A (r) = — 

r sin 6 

,S/ x g(l + cos0), 

A (r) = — ej, 

r sin 9 


(1.280a) 

(1.280b) 


where — — sin (f> e x + cos cp e y 



1.9.2 The Wu-Yang monopole 

Wu and Yang (1975) noticed that the geometrical and topological structures 
behind the Dirac monopole are best described by fibre bundles. In chapters 9 
and 10, we give an account of the Dirac monopole in terms of fibre bundles and 
their connections. Here we outline the idea of Wu and Yang without introducing 
the fibre bundle. Wu and Yang noted that we may employ more than one vector 
potential to describe a monopole. For example, we may avoid singularities if 
we adopt A N in the northern hemisphere and A s in the southern hemisphere 
of the sphere S surrounding the monopole. These vector potentials yield the 
magnetic field B = gr / r 3 , which is non-singular everywhere on the sphere. 
On the equator of the sphere, which is the boundary between the northern and 
southern hemispheres, A N and A s are related by the gauge transformation, 
A n — A s = grad A. To compute this quantity A, we employ the result of exercise 
1.14, 

A N - A s = -^-e# = grad(2 gcf>) (1.281) 

r sm0 

where use has been made of the expression 


3 / „ 1 3 / „ 

grad / = y- e r + ~^e e 
or r SO 


1 df . 
r sin0 30 ^ 


Accordingly, the gauge transformation function connecting A N and A s is 


A = 2 g<p. (1.282) 

Note that A is ill defined at 6 — 0 and 0 — it. Since we perform the gauge 
transformation only at 0 = it /2, these singularities do not show up in our analysis. 
The total flux is 

<t> = * curl A ■ dS = j curl A N ■ dS + j curlA s -dS (1.283) 

Js Ju N Ju s 

where 6/- and Us stand for the northern and southern hemispheres respectively. 
Stokes’ theorem yields 

3> = <b A n • ds — ® A s • ds = (f (A N - A s ) • ds 
J equator J equator J equator 

= ® grad(2g0) • ds = Agn (1.284) 

J equator 

in agreement with (1.277). 


1.9.3 Charge quantization 

Consider a point particle with electric charge e and mass m moving in the field 
of a magnetic monopole of charge g. If the monopole is heavy enough, the 


Schrodinger equation of the particle takes the form 


(^p A^j i/f(r) = E\j/{r). 


(1.285) 


It is easy to show that under the gauge transformation A — > A + grad A, the 
wavefunction changes as i/ r — »■ exp(icA /hc)x//. In the present case, A N and A s 
differ only by the gauge transformation A N — A s = grad(2 g<j>). If i/r N and i /r s are 
wavefunctions dehned on Us and Us respectively, they are related by the phase 
change 

. « . . ( ieA\ . \ . 


i fr (r) — exp 


r'(r). 


(1.286) 


Let us take 9 — n/2 and study the behaviour of wavefunctions as we go round 
the equator of the sphere from (p — Qio <p — lit . The wavefunction is required to 
be single valued, hence (1.286) forces us to take 


n e Z, 


(1.287) 


This is the celebrated Dirac quantization condition for the magnetic charge; if 
the magnetic monopole exists, the magnetic charge takes discrete values, 


n e Z. 


(1.288) 


By the same token, if there exists a magnetic monopole somewhere in the 
universe, all the electric charges are quantized. 

1.10 Instantons 

The vacuum-to-vacuum amplitude in the Euclidean theory is 


Z = (0|0) oc J V(j) e~ s[cl> ’ d ^ ] 


(1.289) 


where S is the Euclidean action. Equation (1.289) shows that the principal 
contribution to Z comes from the values of <fi{x) which give the local minima 
of S[(j), ((, </)]. In many theories there exist a number of local minima in addition 
to the absolute minimum. In the case of non- Abelian gauge theories these minima 
are called instantons. 


1.10.1 Introduction 

Let us consider the SU(2) gauge theory defined in the four-dimensional Euclidean 
space IR 4 . The action is 

S = J d 4 xC(x) = J d 4 x[-|trT /iP TH (1.290) 



where the field strength is 


with 


The field equation is 


3"/iv — 9/i-^v 9 y yi n + g[A./i, A v ] 


= A “ qr _ r ff 

^ J nv — r /tv • 


T^H^HV — dfi^/xv + 3>v] — 0. 


(1.291) 


(1.292) 


In the path integral only those field configurations with finite action 
contribute. Suppose A fl satisfies 

Ap —> if/(x) _1 9 M t/(x) as |x| -> oo (1.293) 

where {/ (x) is an element of SU(2). We easily find that T /(y vanishes for the ,A /X 
of (1.293). We require that on sphere S' 3 of large radius, the gauge potential be 
given by (1.293). 

Later we show that this configuration is characterized by the way in which 
S 3 is mapped to the gauge group SU(2). Non-trivial configurations are those that 
cannot be deformed continuously to a uniform configuration. They were proposed 
by Belavin et al (1975) and are called instantons. 


1.10.2 The (anti-)self-dual solution 

In general, solving a second-order differential equation is more difficult than 
solving a first-order one. It is nice if a second-order differential equation can 
be replaced by a first-order one which is equivalent to the original problem. Let 
us consider the inequality 

I d 4 x tr (3> ± *3» 2 > 0. (1.294) 

Clearly ( 1 .294) is saturated if 

3> = ± * 3>. (1.295) 

If the positive sign is chosen, T is said to be self-dual while the negative sign 
gives an anti-self-dual solution. If (1.295) is satisfied, the field equation is 
automatically satisfied since 

'D /( T /XV = ±T>n * = 0 (Bianchi identity). (1.296) 

As we will show in section 10.5, the integral 

Q= / d4 - x trT /X y*T" v 


(1.297) 



is an integer characterizing the way S 3 is mapped to SU(2). If 9 is self-dual then 
Q is positive, and if 3“ is anti-self-dual then Q is negative. From (1.294), we find 
(note that *9 /xu * 9 / ' 1 ' = 9 / - t ,>9 MV ) that 

J d 4 x (29 /xu 9 ,il ' ±2* 9 /xlJ * 9 /iV ) > 0. (1.298) 

From this inequality and the definition of the action, we find that 

S>8tt 2 \Q\ (1.299) 

where the inequality is saturated for (1.295). Let us concentrate on the self-dual 
solution 9 = *9. We look for an instanton solution of the form 

A, x = \f(r)U{x)- u d lx U(x) (1.300) 

where r = |x| and 



(1.306) 



Problems 


1.1 Consider a Hamiltonian of the form 

>(^) 2 + ‘ <v ^ + v W 

where V (<p) (> 0) is a potential. If (p is a time-independent classical solution, we 
may drop the first term and write H[(p ] = H\[<p] + H 2 [</>], where 

\ J d" x (V0) 2 W 2 [</>] ee J d n x 

(1) Consider a scale transformation cp(x) -> (p(Xx). Show that //,[</>] 
transforms as 

Hm -> #,'[</>] = l ( "- 2 > Hm H 2 [4>] -> = X~ n H 2 [<P]. 

(2) Suppose (f> satisfies the field equation. Show that 

(2 - n)H\[(j)} - nH 2 [4>] = 0. 



[Hint: Take the /.-derivative of H^[<f>] + H^lip] and put 1=1.] 

(3) Show that time-independent topological excitations of H[(p] exist if and 
only if n = 1 (Derrick’s theorem). Consider ways out of this restriction. 



2 


MATHEMATICAL PRELIMINARIES 


In the present chapter we introduce elementary concepts in the theory of maps, 
vector spaces and topology. A modest knowledge of undergraduate mathematics, 
such as set theory, calculus, complex analysis and linear algebra is assumed. 

The main purpose of this book is to study the application of the theory of 
manifolds to the problems in physics. Vector spaces and topology are, in a sense, 
two extreme viewpoints of manifolds. A manifold is a space which locally looks 
like K" (or C" ) but not necessarily globally. As a first approximation, we may 
model a small part of a manifold by a Euclidean space M" (or C") (a small 
area around a point on a surface can be approximated by the tangent plane at 
that point); this is the viewpoint of a vector space. In topology, however, we 
study the manifold as a whole. We want to study the properties of manifolds and 
classify manifolds using some sort of ‘measures’. Topology usually comes with 
an adjective: algebraic topology, differential topology, combinatorial topology, 
general topology and so on. These adjectives refer to the measure we use when 
classifying manifolds. 

2.1 Maps 

2.1.1 Definitions 

Let X and Y be sets. A map (or mapping) / is a rule by which we assign y e Y 
for each x e X. We write 

f :X —*■ Y. (2.1) 

If / is defined by some explicit formula, we may write 

/ : x f(x ) (2.2) 

There may be more than two elements in X that correspond to the same y e Y . A 
subset of X whose elements are mapped to y e Y under / is called the inverse 
image of y, denoted by / _1 (y) = {.r e X\f(x) = y). The set X is called 
the domain of the map while Y is called the range of the map. The image of 
the map is f(X) — {y e T|y = fix) for some x e X} C Y. The image 
fiX) is also denoted by im /. The reader should note that a map cannot be 
defined without specifying the domain and the range. Take fix ) = expx, for 
example. If both the domain and the range are ffi, fix) = — 1 has no inverse 



image. If. however, the domain and the range are the complex plane C, we find 
/ -1 (— 1) = {(2 n+ 1 )jt\\h e Z}. The domain X and the range Y are as important 
as / itself in specifying a map. 

Example 2.1. Let / : E — »■ I be given by fix) = sin.r. We also write 
/:.m sinx. The domain and the range are E and the image /(E) is [—1, 1]. 
The inverse image of 0 is / -l (0) = [nn\n e Zj. Let us take the same function 
fix) — sinx = (e LV — e _i *)/2i but / : C — »■ C this time. The image /(C) is the 
whole complex plane C. 

Definition 2.1. If a map satisfies a certain condition it bears a special name. 

(a) A map / : X —*■ Y is called injective (or one to one) if x / x' implies 
fix) / fix'). 

(b) A map / : X — »■ Y is called surjective (or onto) if for each y e Y there 
exists at least one element x e X such that fix) = y. 

(c) A map / : X — »■ Y is called bijective if it is both injective and surjective. 

Example 2.2. A map / : E —*■ E defined by / : x i->- ax (a e E — {0}) is 
bijective. / : E — ► E defined by /:.m x 2 is neither injective nor surjective. 
/ : E — »■ E given by / : x i->- exp x is injective but not surjective. 

Exercise 2.1. A map / : E -> E defined by / : x i— > sinx is neither injective 
nor surjective. Restrict the domain and the range to make / bijective. 

Example 2.3. Let M be an element of the general linear group GL(n, E) whose 
matrix representation is given by n x n matrices with non-vanishing determinant. 
Then M : E" —*■ E" , xi-> Mx is bijective. If det M = 0, it is neither injective 
nor surjective. 

A constant map c : X — »■ Y is defined by c(x) = vo where yo is a fixed 
element in Y and x is an arbitrary element in X. Given a map / : X —> Y , we 
may think of its restriction to A C X, which is denoted as /\a : A —*■ Y. Given 
two maps / : X — > Y and g : Y — »■ Z, the composite map of / and g is a map 
g o f : X Z defined by g o /(x) = gifix)). A diagram of maps is called 
commutative if any composite maps between a pair of sets do not depend on how 
they are composed. For example, in figure 2.1, / o g = h o j and / o g — k etc. 

Exercise 2.2. Let / : E E be defined by / : x — > x 1 and g : E — > E by 
g : x — > exp x . What are g o / : E — ► E and / o g : E — » E? 

If A C A, an inclusion map i : A X is defined by i(a) — a for any 
a e A. An inclusion map is often written as i \ A X . The identity map 
idx : A — » A is a special case of an inclusion map, for which A = X. If 
/ : X — > Y defined by / : x h> fix) is bijective, there exists an inverse map 
/ -1 : Y — > X, such that / - : fix) x, which is also bijective. The maps / 


8 


Y 


X 



Figure 2.1. A commutative diagram of maps. 

and f~ l satisfy / o f~ l = idy and / _1 of = idy. Conversely, if / : X — > Y 
and g : Y — > X satisfy / o g = idy and go f — idy, then / and g are bijections. 
This can be proved from the following exercise. 

Exercise 2.3. Show that if / : X — > Y and g : Y — >■ X satisfy g o f = idy, / is 
injective and g is surjective. If this is applied to / o g — idy as well, we obtain 
the previous result. 


Example 2.4. Let / : R — * (0, oo) be a bijection defined by y : jc i — >• expx. 
Then the inverse map /“* : (0, oo) — »■ ffi is / _l : x i->- lnx. Let g : 
(— tt/2, tt/2) — »■ (—1, 1) be a bijection defined by g : x —*■ sinx. The inverse 
map is g _l :x i-> sin -1 x. 

Exercise 2.4. The n-dimensional Euclidean group E" is made of an n- 
dimensional translation a : x — »• x + a (x, a e ffi" ) and an O(n) rotation R : x -> 
Rx, R e O(n). A general element (R, a) of E n acts onxby(K,a) : x !->■ Rx+a. 
The product is defined by ( R 2 , 02 ) x ( R\ , a 1 ) : x i-> R 2 (R\x + «i) + aj, that 
is, (R 2 , 02 ) o (/?i , at) = (R 2 R 1 , + ai)- Show that the maps a. R and ( R , a) 

are bijections. Find their inverse maps. 

Suppose certain algebraic structures (product or addition, say) are endowed 
with the sets X and Y. If / : X — > Y preserves these algebraic structures, then / 
is called a homomorphism. For example, let X be endowed with a product. If / 
is a homomorphism, it preserves the product, f(ab ) = f(a)f(b). Note that ab is 
defined by the product rule in X, and f(a ) f(b) by that in Y . If a homomorphism 
/ is bijective, / is called an isomorphism and X is said to be isomorphic to Y, 
denoted x = y. 


2.1.2 Equivalence relation and equivalence class 

Some of the most important concepts in mathematics are equivalence relations 
and equivalence classes. Although these subjects are not directly related to maps, 
it is appropriate to define them at this point before we proceed further. A relation 
R defined in a set X is a subset of X 2 . If a point (a, b) e X 2 is in R, we may write 
aRb. For example, the relation > is a subset of K 2 . If (a, b) e >, then a > b. 

Definition 2.2. An equivalence relation ~ is a relation which satisfies the 
following requirements: 

(i) a ~ a (reflective). 

(ii) If a ~ b, then b ~ a (symmetric). 

(iii) If a ~ b and b ~ c, then a ~ c (transitive). 

Exercise 2.5. If an integer is divided by 2, the remainder is either 0 or 1. If two 
integers n and in yield the same remainder, we write m ~ n. Show that ~ is an 
equivalence relation in Z . 

Given a set X and an equivalence relation ~, we have a partition of X into 
mutually disjoint subsets called equivalence classes. A class [a] is made of all 
the elements x in X such that x ~ a, 

[a] = {x e X\x ~ a} (2.3) 

[a] cannot be empty since a ~ a. We now prove that if [a] n [b\ ^ 0 then 
[a] = [b\. First note that a ~ b. (Since [a] fl [b] ^ 0 there is at least one 
element in [a] fl [b] that satisfies c ~ a and c — b. From the transitivity, we 
have a ~ b.) Next we show that [a] C [/>]. Take an arbitrary element a' in [a]; 
a' ~ a. Then a ~ b implies b ~ a', that is a' e [b]. Thus, we have [a] C [Z>]. 
Similarly, [a] D [Z>] can be shown and it follows that [a] = [b]. Hence, two 
classes [a] and [Z>] satisfy either [a] = [Z>] or [a] fl [Z>] = 0. In this way a set X 
is decomposed into mutually disjoint equivalence classes. The set of all classes 
is called the quotient space, denoted by X/ ~. The element a (or any element 
in [a]) is called the representative of a class [a]. In exercise 2.5, the equivalence 
relation ~ divides integers into two classes, even integers and odd integers. We 
may choose the representative of the even class to be 0, and that of the odd class 
to be 1 . We write this quotient space Z/ ~. Z/ ~ is isomorphic to Z 2 , the cyclic 
group of order 2, whose algebra is defined by 0 + 0 = 0, 0+1 = 1+0 = 1 
and 1 + 1 = 0. If all integers are divided into equivalence classes according to 
the remainder of division by n, the quotient space is isomorphic to Z„, the cyclic 
group of order n . 

Let X be a space in our usual sense. (To be more precise, we need the 
notion of topological space, which will be defined in section 2.3. For the time 
being we depend on our intuitive notion of ‘space’.) Then quotient spaces may 
be realized as geometrical figures. For example, let x and y be two points in ffi. 



(a) 


x - 2ir x 
— I — • t — • 

~2tt 0 


x + 2t; 

H * I— 

2 7! 4 7T 


(b) 


x 


0 



Figure 2.2. In («) all the points x + 2nn, n e Z are in the same equivalence class [x]. We 
may take x e [0, 2n) as a representative of [x]. (b) The quotient space R/ ~ is the circle 
S 1 . 


Introduce a relation ~ by: x ~ y if there exists n e TL such that y = x + Ijrn. 
It is easily shown that ~ is an equivalence relation. The class [x] is the set 
{. . . , x — h r, x, x + 2tc , . . .}. A number x e [0, 2tc) serves as a representative of 
an equivalence class [x], see figure 2.2(a). Note that 0 and 2jt are different points 
in M but, according to the equivalence relation, these points are looked upon as 
the same element in 11/ ~. We arrive at the conclusion that the quotient space 
M/ ~ is the circle S 1 = {e lS |0 < 9 < 2: r}; see figure 2.2(b). Note that a point 
s is close to a point 2n — s for infinitesimal s. Certainly this is the case for S 1 , 
where an angle e is close to an angle 2 tx — s, but not the case for M. The concept 
of closeness of points is one of the main ingredients of topology. 

Example 2.5. (a) Let A be a square disc {(x, y) e M 2 | |x| > 1, |y| > 1}. If we 
identify the points on a pair of facing edges, (— 1, y) ~ (1, y), for example, we 
obtain the cylinder, see figure 2.3(a). If we identify the points (—1, —y) ~ (1, y), 
we find the Mobius strip, see figure 2.3 (b). [Remarks: If readers are not familiar 
with the Mobius strip, they may take a strip of paper and glue up its ends after 
a 7r -twist. Because of the twist, one side of the strip has been joined to the 
other side, making the surface single sided. The Mobius strip is an example 
of a non-orientable surface, while the cylinder has definite sides and is said to 
be orientable. Orientability will be discussed in terms of differential forms in 
section 5.5.] 

(b) Let (xi, y i ) and (x 2 , yi) be two points in M 2 and introduce an equivalence 
relation ~ by: (xi,yi) ~ (x 2 , yi) if X 2 = xq + 2 nn x and y 2 = yi + 2nn y , 
n x ,n y e Z. Then ~ is an equivalence relation. The quotient space M 2 / ~ is 
the torus T 2 (the surface of a doughnut), see figure 2.4(a). Alternatively, T 2 is 
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cylinder. ( b ) If the edges are identified in the opposite direction, we have a Mobius strip. 
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Figure 2.4. If all the points (x + 2nn x , y + 2nn y ), n x , n y e Z are identified as in (a), 
the quotient space is taken to be the shaded area whose edges are identified as in ( b ). This 
resulting quotient space is the torus T 2 . 


represented by a rectangle whose edges are identified as in figure 2A{b). 

(c) What if we identify the edges of a rectangle in other ways? Figure 2.5 
gives possible identifications. The spaces obtained by these identifications are 



Figure 2.5. The Klein bottle (a) and the projective plane ( b ). 


called the Klein bottle, figure 2.5(a), and the projective plane, figure 2.5(h), 
neither of which can be realized (or embedded) in the Euclidean space JR 3 without 
intersecting with itself. They are known to be non-orientable. 

The projective plane, which we denote RP 2 , is visualized as follows. Let us 
consider a unit vector n and identify n with —n, see figure 2.6. This identification 
takes place when we describe a rod with no head or tail, for example. We are 
tempted to assign a point on S 2 to specify the ‘vector’ n. This works except for 
one point. Two antipodal points n = (0, <p) and — n — (n — 9, 7t + <f>) represent 
the same state. Then we may take a northern hemisphere as the coset space S 2 / ~ 
since only a half of S 2 is required. However, the coset space is not just an ordinary 
hemisphere since the antipodal points on the equator are identified. By continuous 
deformation of this hemisphere into a square, we obtain the square in figure 2.5(b). 

(d) Let us identify pairs of edges of the octagon shown in figure 2.1(a). The 
quotient space is the torus with two handles, denoted by E 2 , see figure 2.1(b). 
Eg, the torus with g handles, can be obtained by a similar identification, see 
problem 2.1. The integer g is called the genus of the torus. 

(e) Let D 2 = {(x, y) e M 2 |x 2 + y 2 < 1} be a closed disc. Identify the 
points on the boundary {(x,y) e IR 2 |x 2 + y 2 = 1}; (xi,yi) ~ (x 2 ,yi) if 
x 2 + y 2 = x 2 + y 2 = 1. Then we obtain the sphere S 2 as the quotient space 
D 2 / ~, also written as D 2 /S l , see figure 2.8. If we take an ^-dimensional disc 

D n — {(xo, . . . , x") e M" +1 1 (xo) 2 H + (x") 2 < 1} and identify the points on 

the surface S" -1 , we obtain the n -sphere S n , namely D n /S n ~ l = S" . 


Exercise 2.6. Let H be the upper-half complex plane {r e C| Im r > 0}. Define a 



Figure 2.6. If n has no head or tail, one cannot distinguish n from — n and they must 
be identified. One obtains the projective plane RP- by this identification n ~ — n; 
RP~ — S 2 / It suffices to take a hemisphere to describe the coset space. Note, however, 
that the antipodal points on the equator are identified. 


(a) 



a 2 

Figure 2.7. If the edges of (a) are identified a torus with two holes (genus two) is obtained. 




Figure 2.8. A disc D 2 whose boundary S 1 is identified is the sphere S 2 . 


group 
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a, b, c, d e Z, ad — be = 



SL(2, Z) = 


(2.4) 



Introduce a relation for r, x' e H, by r ~ x' if there exists a matrix 

Me SL(2, Z) 
c a ) 

such that 

= (ax + b)/(cr + d). (2.5) 

Show that this is an equivalence relation. (The quotient space H/SL(2, Z) is 
shown in figure 8.3.) 

Example 2.6. Let G be a group and H a subgroup of G. Let g, g' e G and 
introduce an equivalence relation ~ by g ~ g' if there exists h e H such that 
g' = gh. We denote the equivalence class [g] = {gh\h e H] by gH . The class 
gH is called a (left) coset. gH satisfies either gH (T g'H = 0 or gH = g'H . 
The quotient space is denoted by G/H . In general G/H is not a group unless H 
is a normal subgroup of G . that is, ghg~ x e H for any g e G and h e H . If 
H is a normal subgroup of G, G/H is called the quotient group, whose group 
operation is given by [g] * [g'] = (gg'], where * is the product in G/H . Take 
ghe[g] and g'h'e[g']. Then there exists Well such that hg' = g'h" and hence 
ghg'h ' = gg'h" h' e[gg'\. The unit element of G/H is the equivalence class [e] 
and the inverse element of [g] is [g -1 ]. 

Exercise 2.7. Let G be a group. Two elements a, b e G are said to be conjugate 
to each other, denoted by a ~ b, if there exists g e G such that b — gag~ l . Show 
that ~ is an equivalence relation. The equivalence class [a] = {gag~ l |g e G } is 
called the conjugacy class. 

2.2 Vector spaces 

2.2.1 Vectors and vector spaces 

A vector space (or a linear space) V over a field K is a set in which two 
operations, addition and multiplication by an element of K (called a scalar), are 
defined. (In this book we are mainly interested in K — ffi and C.) The elements 
(called vectors) of V satisfy the following axioms: 

(i) u + v = v + u . 

(ii) (« + ») + w = u + (u + w). 

(iii) There exists a zero vector 0 such that v + 0 = v. 

(iv) For any u, there exists — u, such that u + (— m) = 0. 

(v) c(u + v) — cu + cv. 

(vi) (c + d)u — cu + du. 

(vii) (cd)u = c(du). 

(viii) 1 m = u. 

Here u, v, w e V and c, d e K and 1 is the unit element of K. 



Let { 17 / } be a set of k (>0) vectors. If the equation 

X\V\ + X2V2 H h XkVk — 0 


( 2 . 6 ) 


has a non-trivial solution, x, ^ 0 for some i, the set of vectors {«/} is called 
linearly dependent, while if (2.6) has only a trivial solution, x,- = 0 for any i , 
{ v , } is said to be linearly independent. If at least one of the vectors is a zero 
vector 0, the set is always linearly dependent. 

A set of linearly independent vectors {<?, ) is called a basis of V , if any 
element v e V is written uniquely as a linear combination of {c,}: 

v = v 1 ei + v 2 e 2 b v n e n . (2.7) 

The numbers v' e K are called the components of v with respect to the basis 
{e j}. If there are n elements in the basis, the dimension of V is n, denoted by 
dim V = n. We usually write the n -dimensional vector space over K as V (n. K) 
(or simply V if n and K are understood from the context). We assume n is finite, 

2.2.2 Linear maps, images and kernels 

Given two vector spaces V and W, a map / : V — > W is called a linear map 
if it satisfies f(ci\V\ + ciiV 2 ) = a\f{v\) + ajfivf) for any ci\,a 2 e K and 
®l, V 2 e V. A linear map is an example of a homomorphism that preserves the 
vector addition and the scalar multiplication. The image of / is f (V ) C W and 
the kernel of / is {u e V\ f(v) — 0} and denoted by im / and ker / respectively, 
ker / cannot be empty since /( 0) is always 0. If W is the field K itself, / is 
called a linear function. If / is an isomorphism, V is said to be isomorphic to 
W and vice versa, denoted by V = VV'. It then follows that dim V — dim VV. 
In fact, all the n -dimensional vector spaces are isomorphic to K n , and they are 
regarded as identical vector spaces. The isomorphism between the vector spaces 
is an element of GL(«, K). 

Theorem 2.1. If / : V —>■ W is a linear map, then 

dim V = dim (ker/) + dim(im/). (2.8) 

Proof. Since / is a linear map, it follows that ker / and im / are vector spaces, 
see exercise 2.8. Let the basis of ker / be {gq, . . . , g,.} and that of im/ be 
{h \, .... h \ }. For each i (I < i < s), take h, e V such that fill,) — //' and 
consider the set of vectors {gj, .... g ( ., hi, ... , h s }. 

Now we show that these vectors form a linearly independent basis of V . 
Take an arbitrary vector v e V . Since /( i>) e im /, it can be expanded as f(v) — 
c'h'i — c l f(hj). From the linearity of /, it then follows that /(v — c'hj) = 0, that 
is v — c'hj e ker /. This shows that an arbitrary vector v is a linear combination 
of {gj, .... g,., hi, ... , h s }. Thus, V is spanned by r + s vectors. Next let us 



assume a' g, + b'hj — 0. Then 0 = /( 0) = fia'gj + b l h \ ) = b' f{hi) = b'h 
which implies that b‘ — 0. Then it follows from a' g, = 0 that a' = 0, and 
the set {gq, . . . , g r . hi, . . h s } is linearly independent in V. Finally we find 
dim V = r + s = dimfker /) + dim(im /). □ 

[ Remark : The vector space spanned by {hi, , h s ] is called the orthogonal 
complement of ker / and is denoted by (ker f ) 1 .] 

Exercise 2.8. (1) Let / : V -> W be a linear map. Show that both ker / and im / 
are vector spaces. 

(2) Show that a linear map / : V -> V is an isomorphism if and only if 
ker / = { 0 }. 

2.2.3 Dual vector space 

The dual vector space has already been introduced in section 1.2 in the context of 
quantum mechanics. The exposition here is more mathematical and complements 
the materials presented there. 

Let / : V —> K be a linear function on a vector space V (n. K) over a 
field K. Let {c, | be a basis and take an arbitrary vector v = v l ei + ■ ■ ■ + v"e n . 
From the linearity of /, we have f(v) — f{e\) + ■ ■ ■ + v n f(e n ). Thus, if we 

know f(ei) for all i, we know the result of the operation of / on any vector. It is 
remarkable that the set of linear functions is made into a vector space, namely a 
linear combination of two linear functions is also a linear function. 

(a\f\ + a 2 fi){v) = aifi(v) + a 2 f 2 (v) (2.9) 

This linear space is called the dual vector space to V (n, K) and is denoted by 

V*(n, K ) or simply by V*. If dim V is finite, dim V* is equal to dim V . Let 
us introduce a basis {e* 1 } of V*. Since e* 1 is a linear function it is completely 
specified by giving e*' (ej) for all j . Let us choose the dual basis, 

c*''(e y ) = 5;.. (2.10) 

Any linear function /, called a dual vector in this context, is expanded in terms 
of {e*'{, 

f=fie*‘. (2.11) 

The action of / on v is interpreted as an inner product between a column vector 
and a row vector. 


/ (v) - fte*' (v’ e j) = fivU*\ej) = 0. (2.12) 

We sometimes use the notation ( , ) : V * x V — > K to denote the inner product. 

Let V and W be vector spaces with a linear map / : V — > W and let 
g : W -> K be a linear function on W ( g e W*). It is easy to see that the 
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Figure 2.9. The pullback of a function g is a function f*(g) = g o f . 


composite map g o f is a linear function on V. Thus, / and g give rise to an 
element h e V* defined by 

h(v) = g{f( v)) v e V. (2.13) 

Given g e W*, a map / : V — »■ W has induced a map h e V*. Accordingly, 
we have an induced map f* : W* -> V* defined by /* : g i->- h — f*(g ), see 
figure 2.9. The map h is called the pullback of g by /*. 

Since dim V* — dim V, there exists an isomorphism between V and V*. 
However, this isomorphism is not canonical; we have to specify an inner product 
in V to define an isomorphism between V and V* and vice versa, see the next 
section. The equivalence of a vector space and its dual vector space will appear 
recurrently in due course. 

Exercise 2.9. Suppose {/ ; -} is another basis of V and {/*'} the dual basis. In 
terms of the old basis, /, is written as /, = A , 7 e j where A e GL (n, K). Show 
that the dual bases are related by e*‘ = f * J A . 

2.2.4 Inner product and adjoint 

Let V = V (m, K) be a vector space with a basis {c, } and let g be a vector space 
isomorphism g : V —*■ V* , where g is an arbitrary element of GL (m, K). The 
component representation of g is 

g : v j gij v j . (2.14) 

Once this isomorphism is given, we may define the inner product of two vectors 
»i, V 2 e V by 

g(vi,v 2 ) = (gvi,v 2 ). (2.15) 

Let us assume that the field A" is a real number EL for definiteness. Then 
equation (2.15) has a component expression. 


g(Vl,V2) = VlgjiV 2 J - 


( 2 . 16 ) 


We require that the matrix (gjj) be positive definite so that the inner product 
g(v, v) has the meaning of the squared norm of v. We also require that the metric 
be symmetric: g,j = gjj so that g(vu «2) = g(v 2 , Vi). 

Next, let W — W(n, M) be a vector space with a basis {/„} and a vector 
space isomorphism G : W — > W*. Given a map / : V — > W, we may define the 
adjoint of /, denoted by /, by 

G(w, fv) = g(v, fw) (2.17) 

where v e V and w e W. It is easy to see that (/) = /. The component 
expression of equation (2.17) is 

u/G^/V - JgijPaW 01 (2.18) 

where /^, and f 1 a are the matrix representations of / and f respectively. If 
g^ — Sjj and G a p = 8 a p, the adjoint / reduces to the transpose f l of the matrix 
/• 

Let us show that dimim / = dimim/. Since (2.18) holds for any v e V 
and w e W, we have G a p f^j = gij P that is 

f = g- l fG\ (2.19) 

Making use of the result of the following exercise, we obtain rank / = rank /, 
where the rank of a map is defined by that of the corresponding matrix (note that 
g e GL (m, TO) and G e GL («, K)). It is obvious that dimim / is the rank of a 
matrix representing the map / and we conclude dim im / = dim im /. 

Exercise 2. 1 0. Let V — V (m , K) and W = W(n, TO) and let / be a matrix 
corresponding to a linear map from V to W . Verify that rank / = rank/ 1 = 
rank(M/ ( V), where M e GL (m, K) and N e GL (n, TO). 

Exercise 2.11. Let V be a vector space over C The inner product of two vectors 
v 1 and V 2 is defined by 

g(V\,V2) = Vi l gijV2 j (2.20) 

where ~ denotes the complex conjugate. From the positivity and symmetry of the 
inner product, g(iq , ^ 2 ) = g(v 2 , v 1 ), the vector space isomorphism g : V -> V * 
is required to be a positive-definite Hermitian matrix. Let / : V -> W be a 
(complex) linear map and G : W -> W* be a vector space isomorphism. The 
adjoint of / is defined by g(v, fw) = G(w , fv). Repeat the analysis to show 
that 

(a) / = g~ l f t G ’ , where ' denotes the Hermitian conjugate, and 

(b) dimim / = dimim /. 

Theorem 2.2. (Toy index theorem) Let V and W be finite-dimensional vector 
spaces over a field K and let / : V -» W be a linear map. Then 


dimker / — dimker f = dim V — dim W. 


(2.21) 



Proof. Theorem 2.1 tells us that 


dim V — dim ker / + dim im / 

and, if applied to / : W — > V, 

dim W = dim ker / + dim im /. 

We saw earlier that dim im / = dim im /, from which we obtain 

dim V — dim ker / = dim W — dim ker /. □ 

Note that in (2.21), each term on the LHS depends on the details of the map 
/. The RHS states, however, that the difference in the two terms is independent of 
/ ! This may be regarded as a finite-dimensional analogue of the index theorems, 
see chapter 12. 

2.2.5 Tensors 

A dual vector is a linear object that maps a vector to a scalar. This may be 
generalized to multilinear objects called tensors, which map several vectors and 
dual vectors to a scalar. A tensor T of type ( p , q) is a multilinear map that maps 
p dual vectors and q vectors to ffi, 

p q 

T : 0 V* 0 V -> ffi. (2.22) 

For example, a tensor of type (0, 1) maps a vector to a real number and is 
identified with a dual vector. Similarly, a tensor of type (1,0) is a vector. If 
co maps a dual vector and two vectors to a scalar, co : V* x V x V — »■ IK, oo is of 
type (1, 2). 

The set of all tensors of type (p, q) is called the tensor space of type ( p. q ) 
and denoted by ‘Tq . The tensor product r = /i ® d 6 fj ® T 1 ’, is an element of 
7 P ^ P , defined by 

q+q' J 

t(<wi, . . . , co p , £i, . . . , £ p /; mi, ... , u q , vi,..., v q ') 

= n(w\, ...,a>p, mi, u q )v(f\, %p>; v\,..., ty). (2.23) 

Another operation in a tensor space is the contraction, which is a map from 
a tensor space of type ( p , q) to type (p — 1, q — 1) defined by 

T(..., (2.24) 

where {e, } and \e *' } are the dual bases. 

Exercise 2.12. Let V and W be vector spaces and let / : V — »■ W be a linear 
map. Show that / is a tensor of type (1, 1). 


2.3 Topological spaces 


The most general structure with which we work is a topological space. Physicists 
often tend to think that all the spaces they deal with are equipped with metrics. 
However, this is not always the case. In fact, metric spaces form a subset of 
manifolds and manifolds form a subset of topological spaces. 

2.3.1 Definitions 

Definition 2.3. Let X be any set and T = { U t \i e 1} denote a certain collection of 
subsets of X. The pair (X . T) is a topological space if T satisfies the following 
requirements. 

(i) 0,IeT. 

(ii) If T is any (maybe infinite) subcollection of 7, the family {Uj\j e J } 
satisfies U jejU j e T. 

(iii) If K is any finite subcollection of 7, the family {Uk\k e K] satisfies 

n ksKUic e T. 

X alone is sometimes called a topological space. The 77, are called the open 
sets and T is said to give a topology to X. 

Example 2. 7. (a) If X is a set and T is the collection of all the subsets of X , then 

(i)-(iii) are automatically satisfied. This topology is called the discrete topology. 

(b) Let X be a set and T = {0, X}. Clearly T satisfies (i)— (iii). This topology 
is called the trivial topology. In general the discrete topology is too stringent 
while the trivial topology is too trivial to give any interesting structures on X. 

(c) Let X be the real line iff. All open intervals (a. h) and their unions 

define a topology called the usual topology; a and b may be — oo and oo 
respectively. Similarly, the usual topology in IR" can be defined. [Take a product 
(fli, b\) x • • • x (a n , b n ) and their unions ] 

Exercise 2.13. In definition 2.3, axioms (ii) and (iii) look somewhat imbalanced. 
Show that, if we allow infinite intersection in (iii), the usual topology in M reduces 
to the discrete topology (and is thus not very interesting). 

A metric i/:XxX->iisa function that satisfies the conditions: 

(i) d(x, y) — d{y,x) 

(ii) d(x,y) > 0 where the equality holds if and only if x — y 

(iii) d(x, y) + d(y, z ) > d(x, z ) 

for any x, y, z e X. If X is endowed with a metric d, X is made into a topological 
space whose open sets are given by ‘open discs’, 


U s (20 = {}’ e X\d(x, y) < e} 


(2.25) 



and all their possible unions. The topology T thus defined is called the metric 
topology determined by d. The topological space (X, T) is called a metric space. 
[Exercise: Verify that a metric space (X, T) is indeed a topological space.] 

Let (X, T) be a topological space and A be any subset of X. Then T — {(/;} 
induces the relative topology in A by T 1 = {Ui C\ A\Uj e T). 

Example 2.8. Let X = M" +1 and take the n-sphere S n , 

(x 0 ) 2 + (x 1 ) 2 + • • • + (x' ! ) 2 = L (2.26) 

A topology in S n may be given by the relative topology induced by the usual 
topology on IR' ,+1 . 


2.3.2 Continuous maps 

Definition 2.4. Let X and Y be topological spaces. A map / : X —> Y is 
continuous if the inverse image of an open set in Y is an open set in X. 


This definition is in agreement with our intuitive notion of continuity. For 
instance, let / : M —> R be defined by 


/(*) = 


j-x + 1 

{ - X + \ 


x < 0 
x > 0. 


(2.27) 


We take the usual topology in !R, hence any open interval (a. b) is an open 
set. In the usual calculus, / is said to have a discontinuity at x = 0. For an 
open set (3/2, 2) C Y, we find / -1 (( 3/2, 2)) = (— 1, —1/2) which is an open 
set in X. If we take an open set (1 — 1/4, 1 + 1/4) C Y, however, we find 
/ -1 ((1 — 1/4, 1 + 1/4)) = (—1/4, 0] which is not an open set in the usual 
topology. 

Exercise 2.14. By taking a continuous function / : R — »• R, /(x) = x 2 as an 
example, show that the reverse definition, ‘a map f is continuous if it maps an 
open set in X to an open set in Y’ , does not work. [Hint: Find where (— e, +e) is 
mapped to under /.] 


2.3.3 Neighbourhoods and Hausdorff spaces 

Definition 2.5. Suppose T gives a topology to X . N is a neighbourhood of a 
point x e X if N is a subset of X and N contains some (at least one) open set Uj 
to which x belongs. (The subset N need not be an open set. If N happens to be 
an open set in T, it is called an open neighbourhood.) 

Example 2.9. Take X — with the usual topology. The interval [—1, 1] is a 
neighbourhood of an arbitrary point x e (—1, 1). 



Definition 2.6. A topological space (X ,'T) is a Hausdorff space if, for an 
arbitrary pair of distinct points x, x' e X, there always exist neighbourhoods 
\J x of x and U x ’ of x' such that U x fl U x ’ = 0. 

Exercise 2.15. Let X — {John, Paul, Ringo, George} and Uo = 0, U\ — 
{John}, U 2 = {John. Paul}, t /3 = {John, Paul, Ringo, George}. Show that T = 
{Uo. U 1 , t/ 2 , t/ 3 } gives a topology to X. Show also that (X, T ) is not a Hausdorff 
space. 

Unlike this exercise, most spaces that appear in physics satisfy the Hausdorff 
property. In the rest of the present book we always assume this is the case. 

Exercise 2.16. Show that R with the usual topology is a Hausdorff space. Show 
also that any metric space is a Hausdorff space. 

2.3.4 Closed set 

Let (X . T) be a topological space. A subset A of X is closed if its complement 
in X is an open set, that is X — A e T ■ According to the definition, X and 0 are 
both open and closed. Consider a set A (either open or closed). The closure of A 
is the smallest closed set that contains A and is denoted by A. The interior of A 
is the largest open subset of A and is denoted by A°. The boundary b(A) of A is 
the complement of A° in A; hi A) — A — A°. An open set is always disjoint from 
its boundary while a closed set always contains its boundary. 

Example 2.10. Take X — R with the usual topology and take a pair of open 
intervals (— 00 , a) and ( b , 00 ) where a < b. Since (— 00 , a) U ( b , 00 ) is open 
under the usual topology, the complement [a, b] is closed. Any closed interval 
is a closed set under the usual topology. Let A = (a. /;), then A = [a, b]. 
The boundary b(A) consists of two points {a, b\. The sets ( a , b). [a, b ], ( a , /;], 
and [a, b) all have the same boundary, closure and interior. In R" , the product 
[fli, b\] x • • • x [a n , b n \ is a closed set under the usual topology. 

Exercise 2.17. Whether a set A C X is open or closed depends on X. Let us take 
an interval 1 = (0, 1) in the x-axis. Show that I is open in the x-axis R while it 
is neither closed nor open in the xy-plane R 2 . 

2.3.5 Compactness 

Let (X, T ) be a topological space. A family {A,} of subsets of X is called a 
covering of X, if 

U A ‘ = X - 

iel 

If all the Ai happen to be the open sets of the topology T, the covering is called 

an open covering. 



Definition 2.7. Consider a set X and all possible coverings of X. The set X is 
compact if, for every open covering { LJ t \ i e /}, there exists a finite subset J of I 
such that { Uj \ j e /} is also a covering of X. 

In general, if a set is compact in W , it must be bounded. What else is 
needed? We state the result without the proof. 

Theorem 2.3. Let X be a subset of K" . X is compact if and only if it is closed and 
bounded. 

Example 2.11. (a) A point is compact. 

(b) Take an open interval (a , b) in TO and choose an open covering U n — 
(a, b — 1 /«), n e N. Evidently 


[J U n = (a, b). 

ne’L 

However, no finite subfamily of { U n ) covers (a, b). Thus, an open interval (a, b) 
is non-compact in conformity with theorem 2.3. 

(c) S' 1 in example 2.8 with the relative topology is compact, since it is closed 
and bounded in . 

The reader might not appreciate the significance of compactness from the 
definition and the few examples given here. It should be noted, however, that some 
mathematical analyses as well as physics become rather simple on a compact 
space. For example, let us consider a system of electrons in a solid. If the solid 
is non-compact with infinite volume, we have to deal with quantum statistical 
mechanics in an infinite volume. It is known that this is mathematically quite 
complicated and requires knowledge of the advanced theory of Hilbert spaces. 
What we usually do is to confine the system in a finite volume V surrounded by 
hard walls so that the electron wavefunction vanishes at the walls, or to impose 
periodic boundary conditions on the walls, which amounts to putting the system in 
a torus, see example 2.5(b). In any case, the system is now put in a compact space. 
Then we may construct the Fock space whose excitations are labelled by discrete 
indices. Another significance of compactness in physics will be found when we 
study extended objects such as instantons and Belavin-Polyakov monopoles, see 
section 4.8. In field theories, we usually assume that the field approaches some 
asymptotic form corresponding to the vacuum (or one of the vacua) at spatial 
infinities. Similarly, a class of order parameter distributions in which the spatial 
infinities have a common order parameter is an interesting class to study from 
various points of view as we shall see later. Since all points at infinity are 
mapped to a point, we have effectively compactified the non-compact space TO" 
to a compact space S' 1 — IR" U {oo}. This procedure is called the one-point 
compactification. 



2.3.6 Connectedness 


Definition 2.8. (a) A topological space X is connected if it cannot be written as 
X = X\ U X 2 , where Xi and X 2 are both open and Xi fl X 2 = 0. Otherwise X 

is called disconnected. 

(b) A topological space X is called arcwise connected if, for any points 
x, y e X, there exists a continuous map / : [0, 1] — >• X such that /( 0) = x 
and /( 1) = y. With a few pathological exceptions, arcwise connectedness is 
practically equivalent to connectedness. 

(c) A loop in a topological space X is a continuous map / : [0, 1] -> X 
such that /( 0) = /(I ). If any loop in X can be continuously shrunk to a point, X 
is called simply connected. 

Example 2.12. (a) The real line ffi is arcwise connected while E — {0} is not. 
E" (n > 2) is arcwise connected and so is E" — {0}. 

(b) S n is arcwise connected. The circle .S' 1 is not simply connected. If n > 2, 
S' ' is simply connected. The ^-dimensional torus 

T n = S l x S l x • • • x S l (n > 2) 

' Y 

n 

is arcwise connected but not simply connected. 

(c) E 2 — E is not arcwise connected. E 2 — {0} is arcwise connected but not 
simply connected. E 3 — {0} is arcwise connected and simply connected. 

2.4 Homeomorphisms and topological invariants 
2.4.1 Homeomorphisms 

As we mentioned at the beginning of this chapter, the main purpose of topology 
is to classify spaces. Suppose we have several figures and ask ourselves which 
are equal and which are different. Since we have not defined what is meant by 
equal or different, we may say 'they are all different from each other’ or ‘they 
are all the same figures’. Some of the definitions of equivalence are too stringent 
and some are too loose to produce any sensible classification of the figures or 
spaces. For example, in elementary geometry, the equivalence of figures is given 
by congruence, which turns out to be too stringent for our purpose. In topology, 
we define two figures to be equivalent if it is possible to deform one figure into the 
other by continuous deformation. Namely we introduce the equivalence relation 
under which geometrical objects are classified according to whether it is possible 
to deform one object into the other by continuous deformation. To be more 
mathematical, we need to introduce the following notion of homeomorphism. 

Definition 2.9. Let X \ and A 2 be topological spaces. A map / : X\ — > Xi is a 
homeomorphism if it is continuous and has an inverse / 1 : AL — » A 1 which is 



Figure 2.10. (a) A coffee cup is homeomorphic to a doughnut, (b) The linked rings are 
homeomorphic to the separated rings. 

also continuous. If there exists a homeomorphism between X \ and X2, X 1 is said 
to be homeomorphic to X2 and vice versa. 

In other words, X\ is homeomorphic to X2 if there exist maps f : X \ —> X2 
and g : X2 — > Xi such that fog = idx 2 , and g o f = idx ( . It is easy to show that 
a homeomorphism is an equivalence relation. Reflectivity follows from the choice 
/ = idx, while symmetry follows since if / : X \ Xi is a homeomorphism 

so is / _1 : X2 — > X\ by definition. Transitivity follows since, if / : Zi — > X2 
and g : X2 — >■ X3 are homeomorphisms so is g o / : X\ -> Z3. Now we divide 
all topological spaces into equivalence classes according to whether it is possible 
to deform one space into the other by a homeomorphism. Intuitively speaking, 
we suppose the topological spaces are made out of ideal rubber which we can 
deform at our will. Two topological spaces are homeomorphic to each other if we 
can deform one into the other continuously, that is, without tearing them apart or 
pasting. 

Figure 2.10 shows some examples of homeomorphisms. It seems impossible 
to deform the left figure in figure 2.10(Z?) into the right one by continuous 
deformation. However, this is an artefact of the embedding of these objects 
in If 3 . In fact, they are continuously deformable in If 4 , see problem 2.3. To 
distinguish one from the other, we have to embed them in ,S’ 3 , say, and compare 
the complements of these objects in .S' 3 . This approach is, however, out of the 
scope of the present book and we will content ourselves with homeomorphisms. 

2.4.2 Topological invariants 

Now our main question is: ‘ How can we characterize the equivalence classes 
of homeomorphism?’ In fact, we do not know the complete answer to this 
question yet. Instead, we have a rather modest statement, that is, if two spaces 
have different ‘topological invariants’, they are not homeomorphic to each 
other. Here topological invariants are those quantities which are conserved under 
homeomorphisms. A topological invariant may be a number such as the number 
of connected components of the space, an algebraic structure such as a group or 


a ring which is constructed out of the space, or something like connectedness, 
compactness or the Hausdorff property. (Although it seems to be intuitively 
clear that these are topological invariants, we have to prove that they indeed 
are. We omit the proofs. An interested reader may consult any text book on 
topology. ) If we knew the complete set of topological invariants we could specify 
the equivalence class by giving these invariants. However, so far we know a partial 
set of topological invariants, which means that even if all the known topological 
invariants of two topological spaces coincide, they may not be homeomorphic to 
each other. Instead, what we can say at most is: if two topological spaces have 
different topological invariants they cannot be homeomorphic to each other. 

Example 2.13. (a) A closed line [—1, 1] is not homeomorphic to an open line 
(— 1, 1), since [—1, 1] is compact while (—1, 1) is not. 

(b) A circle S l is not homeomorphic to R, since S l is compact in R 2 while 
R is not. 

(c) A parabola (y = x 2 ) is not homeomorphic to a hyperbola (x 2 — y 2 = 1) 
although they are both non-compact. A parabola is (arcwise) connected while a 
hyperbola is not. 

(d) A circle ,S’ 1 is not homeomorphic to an interval [—1, 1], although they 
are both compact and (arcwise) connected. [—1, 1] is simply connected while 
.S’ 1 is not. Alternatively .S' 1 — {p}, p being any point in .S 1 is connected while 
[—1,1] — {0} is not, which is more evidence against their equivalence, 

(e) Surprisingly, an interval without the endpoints is homeomorphic to a line 
R. To see this, let us take X — (—jt/2, rt / 2) and Y = R and let / : X — > Y be 
f(x) — tan v. Since tan x is one to one on X and has an inverse, tan -1 x, which 
is one to one on R, this is indeed a homeomorphism. Thus, boundedness is not a 
topological invariant. 

(f) An open disc D 2 — {(.r, y) e R 2 \x 2 + y 2 < 1} is homeomorphic to R 2 . 
A homeomorphism / : D 2 — > R 2 may be 


f(x, y ) = 



while the inverse / 1 : R 2 — > D 2 is 


(2.28) 




x y \ 

yi+v 2 + y 2 y/l + x 2 + y 2 ) 


(2.29) 


The reader should verify that / o /“* = id K 2 , and f~ x o f = id D 2 . As we 
saw in example 2.5(e), a closed disc whose boundary .S' 1 corresponds to a point 
is homeomorphic to S 2 . If we take this point away, we have an open disc. The 
present analysis shows that this open disc is homeomorphic to R 2 . By reversing 
the order of arguments, we find that if we add a point (infinity) to R 2 , we obtain 
a compact space S 2 . This procedure is the one-point compactification S 2 — 
R 2 U {oo} introduced in the previous section. We similarly have S" = R" U {oo}. 



(g) A circle S l = {(x, y) e E 2 |x 2 + y 2 = 1} is homeomorphic to a square 
/ 2 = {(x,y) e M 2 1 ( | x | = 1, |y| < 1), (|x| < 1, |y| = 1)}. A homeomorphism 
/ : I 2 -» S 1 may be given by 

fix, y) = ^ r = yjx 2 + y 2 . (2.30) 

Since r cannot vanish, (2.27) is invertible. 

Exercise 2.18. Find a homeomorphism between a circle S 1 = {(x, y) e E 2 |x 2 + 
y 2 = 1} and an ellipse E = {(x, y) e E 2 |(x/a) 2 + (y/b) 2 = 1}. 

2.4.3 Homotopy type 

An equivalence class which is somewhat coarser than homeomorphism but which 
is still quite useful is ‘of the same homotopy type’. We relax the conditions in 
definition 2.9 so that the continuous functions / or g need not have inverses. For 
example, take X — (0, 1) and Y = {0} and let / : X — > Y, /(x) = 0 and 
g : Y — > X, g(0) — Then fog — idy, while go / / idx- This shows that an 
open interval (0, 1) is of the same homotopy type as a point {0}, although it is not 
homeomorphic to {0}. We have more on this topic in section 4.2. 

Example 2.14. (a) .S' 1 is of the same homotopy type as a cylinder, since a cylinder 
is a direct product S 1 xE and we can shrink E to a point at each point of .S’ 1 . By 
the same reason, the Mobius strip is of the same homotopy type as .S 1 . 

(b) A disc D 2 = {(x, y) e E 2 |x 2 + y 2 < 1} is of the same homotopy type 
as a point. D 2 — { (0, 0)} is of the same homotopy type as S 1 . Similarly, E 2 — { 0 } 
is of the same homotopy type as S l and E 3 — { 0 } as S 2 . 

2.4.4 Euler characteristic: an example 

The Euler characteristic is one of the most useful topological invariants. 
Moreover, we find the prototype of the algebraic approach to topology in it. To 
avoid unnecessary complication, we restrict ourselves to points, lines and surfaces 
in E 3 . A polyhedron is a geometrical object surrounded by faces. The boundary 
of two faces is an edge and two edges meet at a vertex. We extend the definition 
of a polyhedron a bit to include polygons and the boundaries of polygons, lines or 
points. We call the faces, edges and vertices of a polyhedron simplexes. Note that 
the boundary of two simplexes is either empty or another simplex. (For example, 
the boundary of two faces is an edge.) Formal definitions of a simplex and a 
polyhedron in a general number of dimensions will be given in chapter 3. We are 
now ready to define the Euler characteristic of a figure in E 3 . 

Definition 2.10. Let X be a subset of E 3 , which is homeomorphic to a polyhedron 
K. Then the Euler characteristic / (X ) of X is defined by 

X ( X ) = (number of verticies in K) — (number of edges in K) 

+ (number of faces in K). (2.31) 


(a) (b) 
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Figure 2.11. Example of a polyhedron which is homeomorphic to a torus. 


The reader might wonder if x (X) depends on the polyhedron K or not. The 
following theorem due to Poincare and Alexander guarantees that it is, in fact, 
independent of the polyhedron K. 

Theorem 2.4. (Poincare-Alexander) The Euler characteristic y (X) is indepen- 
dent of the polyhedron K as long as K is homeomorphic to X. 

Examples are in order. The Euler characteristic of a point is /(■) = 1 by 

definition. The Euler characteristic of a line is / ( ) = 2—1 = 1, since a 

line has two vertices and an edge. For a triangular disc, we find x (triangle) = 
3 — 3 + 1 = 1 . An example which is a bit non-trivial is the Euler characteristic of 
S 1 . The simplest polyhedron which is homeomorphic to S 1 is made of three edges 
of a triangle. Then x (S' 1 ) =3 — 3 = 0. Similarly, the sphere S 2 is homeomorphic 
to the surface of a tetrahedron, hence x (-S' 2 ) = 4 — 6 + 4 = 2. It is easily seen 
that S 2 is also homeomorphic to the surface of a cube. Using a cube to calculate 
the Euler characteristic of S 2 , we have x (.S' 2 ) = 8 — 12 + 6 = 2, in accord with 
theorem 2.4. Historically this is the conclusion of Euler’s theorem: if K is any 
polyhedron homeomorphic to S 2 , with v vertices, e edges and / two-dimensional 
faces, then v — e + f = 2. 

Example 2.15. Let us calculate the Euler characteristic of the torus T 2 . 
Figure 2.11(a) is an example of a polyhedron which is homeomorphic to T 2 . 
From this polyhedron, we find x{T 2 ) = 16 — 32 + 16 = 0. As we saw 
in example 2.5(b), T 2 is equivalent to a rectangle whose edges are identified; 
see figure 2.4. Taking care of this identification, we find an example of a 
polyhedron made of rectangular faces as in figure 2.11 (£>), from which we also 
have x (T 2 ) = 0. This approach is quite useful when the figure cannot be realized 
(embedded) in ffi 3 . For example, the Klein bottle (figure 2.5(a)) cannot be realized 
in ffi 3 without intersecting itself. From the rectangle of figure 2.5(a), we find 
X (Klein bottle) = 0. Similarly, we have x (projective plane) = 1. 



Exercise 2.19. (a) Show that x(Mobius strip) = 0. 

(b) Show that x(^ 2 ) = —2, where £2 is the torus with two handles (see 
example 2.5). The reader may either construct a polyhedron homeomorphic to £2 
or make use of the octagon in figure 2.6(a). We show later that / (£ ? ) — 2 — 2 g, 
where £ g is the torus with g handles. 

The connected sum X$Y of two surfaces X and Y is a surface obtained by 
removing a small disc from each of X and Y and connecting the resulting holes 
with a cylinder; see figure 2.12. Let X be an arbitrary surface. Then it is easy to 
see that 

S 2 ttX = X (2.32) 

since S 2 and the cylinder may be deformed so that they fill in the hole on X ; see 
figure 2.12(a). If we take a connected sum of two tori we get (figure 2.12(h)) 

T 2 #T 2 = £ 2 . (2.33) 

Similarly, £ g may be given by the connected sum of g tori, 

r 2 tjr 2 tt---ur 2 = £ g . (2.34) 

\- ^ 

g factors 

The connected sum may be used as a trick to calculate an Euler characteristic 
of a complicated surface from those of known surfaces. Let us prove the following 
theorem. 

Theorem 2.5. Let X and Y be two surfaces. Then the Euler characteristic of the 
connected sum X#Y is given by 


X(XttT) = X(X) + x(T)-2. 


Proof. Take polyhedra Kx and Ky homeomorphic to X and Y . respectively. We 
assume, without loss of generality, that each of Ky and Ky has a triangle in it. 
Remove the triangles from them and connect the resulting holes with a trigonal 
cylinder. Then the number of vertices does not change while the number of edges 
increases by three. Since we have removed two faces and added three faces, 
the number of faces increases by —2 + 3=1. Thus, the change of the Euler 
characteristic is 0 — 3 + 1 = —2. □ 

From the previous theorem and the equality y (T 2 ) = 0, we obtain x (2+) = 
0 + 0 — 2 = —2 and x(^g) — 8 x 0 — 2(g — 1) = 2 — 2 g, cf exercise 2.19(b). 

The significance of the Euler characteristic is that it is a topological invariant, 
which is calculated relatively easily. We accept, without proof, the following 
theorem. 

Theorem 2.6. Let X and Y be two figures in If 3 . If X is homeomorphic to Y, then 
/(X) = x(T). In other words, if x(2f) ^ x(Y), X cannot be homeomorphic to 
Y. 

Example 2.16. (a) S’ 1 is not homeomorphic to S 2 , since /(S 1 ) = 0 while 
X(S 2 ) = 2. 

(b) Two figures, which are not homeomorphic to each other, may have the 

same Euler characteristic. A point (•) is not homeomorphic to a line ( ) but 

X(-) = x( ) = 1. This is a general consequence of the following fact: if a 

figure X is of the same homotopy type as a figure Y , then x(X) = x(X). 

The reader might have noticed that the Euler characteristic is different from 
other topological invariants such as compactness or connectedness in character. 
Compactness and connectedness are geometrical properties of a figure or a space 
while the Euler characteristic is an integer / (X) e Z. Note that Z is an 
algebraic object rather than a geometrical one. Since the work of Euler, many 
mathematicians have worked out the relation between geometry and algebra 
and elaborated this idea, in the last century, to establish combinatorial topology 
and algebraic topology. We may compute the Euler characteristic of a smooth 
surface by the celebrated Gauss-Bonnet theorem, which relates the integral of 
the Gauss curvature of the surface with the Euler characteristic calculated from 
the corresponding polyhedron. We will give the generalized form of the Gauss- 
Bonnet theorem in chapter 12. 

Problems 

2.1 Show that the 4g-gon in figure 2.13(a), with the boundary identified, 
represents the torus with genus g of figure 2.13 fib). The reader may use 
equation (2.34). 

2.2 Let X = {1, 1/2, ... , l/n, . . .} be a subset of EL Show that X is not closed in 
EL Show that Y = {1 , 1/2, . . . , l/n, . . . , 0} is closed in EL hence compact. 




Figure 2.13. The polygon (a) whose edges are identified is the torus Eg with genus g. 

2.3 Show that two figures in figure 2. 109(Z?) are homeomorphic to each other. Find 
how to unlink the right figure in K 4 . 

2.4 Show that there are only five regular polyhedra: a tetrahedron, a hexahedron, 
an octahedron, a dodecahedron and an icosahedron. [Hint. Use Euler’s theorem.] 
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HOMOLOGY GROUPS 


Among the topological invariants the Euler characteristic is a quantity readily 
computable by the ‘polyhedronization’ of space. The homology groups are 
refitiements, so to speak, of the Euler characteristic. Moreover, we can easily read 
off the Euler characteristic from the homology groups. Let us look at figure 3.1. 
In figure 3.1(a), the interior is included but not in figure 3.1(b). How do we 
characterize this difference? An obvious observation is that the three edges of 
figure 3.1(a) form a boundary of the interior while the edges of figure 3.1(h) do 
not (the interior is not a part of figure 3.1(h)). Clearly the edges in both cases 
form a closed path (loop), having no boundary. In other words, the existence of 
a loop that is not a boundary of some area implies the existence of a hole within 
the loop. This is our guiding principle in classifying spaces here: find a region 
without boundaries, which is not itself a boundary of some region. This principle 
is mathematically elaborated into the theory of homology groups. 

Our exposition follows Armstrong (1983), Croom (1978) and Nash and Sen 
(1983). An introduction to group theory is found in Fraleigh (1976). 

3.1 Abelian groups 

The mathematical structures underlying homology groups are finitely generated 
Abelian groups. Throughout this chapter, the group operation is denoted by + 
since all the groups considered here are Abelian (commutative). The unit element 
is denoted by 0. 

3.1.1 Elementary group theory 

Let G i and G 2 be Abelian groups. A map / : G 1 — »■ G 2 is said to be a 

homomorphism if 

fix + y) = f(x) + f(y) (3.1) 

for any x,y e G\. If / is also a bijection, f is called an isomorphism. If there 
exists an isomorphism / : G 1 — »■ G 2 , G 1 is said to be isomorphic to G 2 , denoted 
by G 1 = Gj- For example, a map / : Z — > Z 2 = {0, 1} defined by 


f(2n) = 0 f(2n + 1) = 1 




Figure 3.1. (a) is a solid triangle while ( b ) is the edges of a triangle without an interior. 


is a homomorphism. Indeed 

/ (2m + 2 n) = /(2(m + n)) = 0 = 0 + 0 = /(2m) + f(2n) 
/(2m + 1 + 2n + 1) = /(2(m + n + 1)) = 0 = I + 1 
= /(2m + 1) + /(2n + 1) 

/(2m + 1 + 2n) = f(2(m + n) + I) = 1 = I + 0 
= /(2m + I) + f(2n). 

A subset // C G is a subgroup if it is a group with respect to the group 
operation of G. For example, 

kZ = {kn\n e Zj k e N 

is a subgroup of Z, while Z 2 = {0, 1} is not. 

Let H be a subgroup of G. We say x, y e G are equivalent if 

x-y e H (3.2) 

and write x ~ y. Clearly ~ is an equivalence relation. The equivalence class to 
which x belongs is denoted by [x]. Let G/H be the quotient space. The group 
operation + in G naturally induces the group operation + in G/H by 

[xi + [y] = [x + y]. (3.3) 

Note that + on the LHS is an operation in G/H while + on the RHS is that in G. 
The operation in G/H should be independent of the choice of representatives. In 
fact, if [./] = [x], [y'l = [v], then x — x' = h,y — y' = g for some h, g e H 
and we find that 

x' + y' = x + y - (h + g) e [x + y] 

Furthermore, G/H becomes a group with this operation, since H is always a 
normal subgroup of G; see example 2.6. The unit element of G/H is [0] = [/?], 



h e H . If H = G, 0 — x e G for any x e G and G/G has just one element [0]. 
If H — {0}, G/H is G itself since x — y — 0 if and only if x — y. 

Example 3.1. Let us work out the quotient group Z/2Z. For even numbers 
we have 2 n — 2m = 2 (« — m) e 2Z and [2m] — [2 n]. For odd numbers 
(2n + l) — (2m+l) = 2 (n — m) e 2Zand[2;n + l] = [2n+l]. Even numbers and 
odd numbers never belong to the same equivalence class since 2n — (2 m + 1 ) / 2Z . 
Thus, it follows that 

Z/2Z={[0],[1]}. (3.4) 

If we define an isomorphism <p : Z /2Z — > Z 2 by <p([0]) = 0 and (pi[ 1]) = 1, we 
find Z /2Z = Z 2 . For general k e N, we have 

Z/kZ=Z k . (3.5) 

Lemma 3.1. Let / : G 1 —> G 2 be a homomorphism. Then 

(a) ker f — [x\x e G\, fix) — 0} is a subgroup of G 1 , 

(b) im / = { x\x e f(G 1 ) c G 2 } is a subgroup of G 2 . 

Proof, (a) Let x, y e ker /. Then x + y e ker / since f(x + y) = fix) + fiy) — 
0 + 0 = 0. Note that 0 e ker / for /(0) = /( 0) + /( 0). We also have — x e ker / 
since /( 0) = fix - x) = fix) + fi~x) = 0. 

(b) Let yi — fix 1 ), yi = fix 2 ) e im / where xi, X 2 e Gi. Since / is a 
homomorphism we have vi + V 2 = /(xi) + /G 2 ) = fix 1 +JC 2 ) e im /. Clearly 


0 e im / since /(0) = 0. If y = / (x), — y e im/ since 0 = fix — x) = 
fix ) + /(-*) implies f(-x) = -y. □ 

Theorem 3.1. (Fundamental theorem of homomorphism) Let / : G \ G 2 

be a homomorphism. Then 

Gi/ker/ = im /. (3.6) 


Proof. Both sides are groups according to lemma 3.1. Define a map <p : 
Gi/ker/ — >• im / by <^([x]) = /(.r). This map is well defined since for 
x' e [x], there exists h e ker / such that x' = x + h and fix') = fix + h) — 
fix) + fih) — fix). Now we show that <p is an isomorphism. First, <p is a 
homomorphism, 

(pi W + [y]) = 1 pi[x + y]) = fix + y) 

= fix) + fiy) = Vi[x]) + cpi[y]). 

Second, <p is one to one: if (pi[x]) — <p([y]), then fix) — fiy) or fix) — fiy) — 
fix — y) = 0. This shows thatx — ye ker / and [x] — [y] . Finally, tp is onto: 
if y e im /, there exists x e G 1 such that fix) = y = (pi[x]). □ 

Example 3.2. Let / : Z — > Z 2 be defined by /(2n) = 0 and /(2n+ 1 ) = 1. Then 
ker / = 2Z and im / = Z 2 are groups. Theorem 3.1 states that Z/2Z = Z 2 , in 
agreement with example 3.1. 



3.1.2 Finitely generated Abelian groups and free Abelian groups 

Let x be an element of a group G. For n e Z,nx denotes 

x + • • • + x (if n > 0) 

n 


and 

(— x)H b (-x) (if n < 0). 

^ v ^ 

I" I 

If n — 0, we put Ox = 0. Take r elements xi, . . . , x r of G. The elements of G of 
the form 

n\X\ + ■ ■ ■ + n r x r ( rij e Z, 1 < i < r) (3.7) 

form a subgroup of G, which we denote H. H is called a subgroup of G 
generated by the generators x\,...,x r . If G itself is generated by finite 
elements x\ , . . . ,x r , G is said to be finitely generated. If n\x\ + • • • + n r x r — 0 
is satisfied only when n\ — • • • = n r — 0, x\, . . . , x r are said to be linearly 
independent. 

Definition 3.1. If G is finitely generated by r linearly independent elements, G is 
called a free Abelian group of rank r. 

Example 3.3. Z is a free Abelian group of rank 1 finitely generated by 1 (or — 1 ). 
Let Z ® Z be the set of pairs {(i, j)\i, j e Z}. It is a free Abelian group of rank 2 
finitely generated by generators (1,0) and (0, 1). More generally 


z ® z ® • • • ® z 

^ V ^ 

r 

is a free Abelian group of rank r . The group Z 2 = {0, 1} is finitely generated by 
1 but is not free since 1 is not linearly independent (note 1 + 1 = 0). 

3.1.3 Cyclic groups 

If G is generated by one element x, G = {0, ±x, ±2x, . . .}, G is called a cyclic 
group. If nx / 0 for any n e Z — {0}, it is an infinite cyclic group while if 
nx — 0 for some n e Z — {0}, a finite cyclic group. Let G be a cyclic group 
generated by x and let / : Z -* G be a homomorphism defined by /(«) = nx. 
f maps Z onto G but not necessarily one to one. From theorem 3.1, we have 
G = im / = Z / ker /. Let N be the smallest positive integer such that Nx = 0. 
Clearly 

ker / = {0, ±N, ±2 N, . . .} = NZ (3.8) 

G = Z / N7h = Z xt- 


and we have 


(3.9) 



If G is an infinite cyclic group, then ker f — {0} and G = Z. Any infinite cyclic 
group is isomorphic to Z while a finite cyclic group is isomorphic to some Z a?. 

We will need the following lemma and theorem in due course. We first state 
the lemma without proof. 

Lemma 3.2. Let G be a free Abelian group of rank r and let H (^0) be a subgroup 
of G. We may always choose p generators xi, . . . , x p , out of r generators of G 
so that k\x \, . . . , kpXp generate H . Thus, H = k\L © ... © k p Z and H is of 
rank p. 

Theorem 3.2. (Fundamental theorem of finitely generated Abelian groups) 

Let G be a finitely generated Abelian group (not necessarily free) with m 
generators. Then G is isomorphic to the direct sum of cyclic groups, 

G = Z © ■ - ■ © Z ®Z kl © ■ • ■ © Z kp (3.10) 

r 

where m = r + p. The number r is called the rank of G. 

Proof. Let G be generated by m elements x\. ... ,x m and let 

/ : Z © • • • © Z — >• G 

m 


be a surjective homomorphism, 


/(hi, . . . , n m ) = n\x\ H b n m x m . 


Theorem 3.1 states that 


Z © • ■ ■ © Z / ker / = G. 

m 


Since ker / is a subgroup of 



m 


lemma 3.2 claims that if we choose the generators properly, we have 


ker / = kiZ © • • • © k p Z. 


We finally obtain 


G ^ Z © ■ ■ ■ © Z /ker/ ^ Z © • • ■ © Z /(kiZ © ■ ■ ■ © k p Z) 

m m 

= Z ® • • • 0 Z ® • • • (B 7L kp' 


□ 


m—p 




Figure 3.2. 0-, 1-, 2- and 3-simplexes. 


3.2 Simplexes and simplicial complexes 

Let us recall how the Euler characteristic of a surface is calculated. We first 
construct a polyhedron homeomorphic to the given surface, then count the 
numbers of vertices, edges and faces. The Euler characteristic of the polyhedron, 
and hence of the surface, is then given by equation (2.31). We abstract this 
procedure so that we may represent each part of a figure by some standard object. 
We take triangles and their analogues in other dimensions, called simplexes, as 
the standard objects. By this standardization, it becomes possible to assign to 
each figure Abelian group structures. 


3.2.1 Simplexes 

Simplexes are building blocks of a polyhedron. A 0-simplex (po) is a point, or 
a vertex, and a 1-simplex (popi) is a line, or an edge. A 2-simplex ( P 0 P 1 P 2 ) is 
defined to be a triangle with its interior included and a 3-simplex {popipipi,) is 
a solid tetrahedron (figure 3.2). It is common to denote a 0-simplex without the 
bracket; (po) may be also written as po- It is easy to continue this construction 
to any /--simplex (popi . . . p r ). Note that for an /--simplex to represent an r- 
dimensional object, the vertices p; must be geometrically independent , that is, no 
(r — l)-dimensional hyperplane contains all the r + 1 points. Let po, .... p r 
be points geometrically independent in where /// > r. The /--simplex 
oy — (po, . . . , p r ) is expressed as 


x e 


IT 


= J2 c ‘Pi’ 

i '=0 


>o,X> = il. 

;=0 J 


(3.11) 


(co, . . . , c r ) is called the barycentric coordinate of x . Since ay is a bounded and 
closed subset of K'” , it is compact. 

Let q be an integer such that 0 < q < r. If we choose q + 1 points 
Pi 0 , . . . , pi out of po, . . . , p r , these q + 1 points define a ^-simplex a q — 
(pi 0 , . . . , pi q ), which is called a <7 -face of o r . We write a q < ay if a q is a face of 



Pi 



Figure 3.3. A O-face pq and a 2-face ( P 1 P 2 P 3 ) °f a 3-simplex (P0P1P2P3)- 

o>. If a q ^ a,-, we say er^ is a proper face of <j r , denoted as a cj < oy. Figure 3.3 
shows a 0-face po and a 2-face {pipipi) of a 3-simplex (poPiP 2 P 3 )- There are 
one 3-face, four 2-faces, six I-faces and four 0-faces. The reader should verify 



that the number of <7 -faces in an r-simplex is 


. A 0-simplex is defined 


to have no proper faces. 

3.2.2 Simplicial complexes and polyhedra 

Let K be a set of finite number of simplexes in IIP . If these simplexes are nicely 
fitted together, K is called a simplicial complex. By ‘nicely’ we mean: 

(i) an arbitrary face of a simplex of K belongs to K, that is, if <7 e K and 
a' < a then a' e K\ and 

(ii) if a and a' are two simplexes of K, the intersection a n <r' is either empty 
or a common face of a and a' , that is, if a, a' e K then either a fl a' = 0 
or a fl a' < a and a fl a' < a' . 

For example, figure 3.4(a) is a simplicial complex but figure 3.4(h) is not. 
The dimension of a simplicial complex K is defined to be the largest dimension 
of simplexes in K. 

Example 3.4. Let o r be an r- simplex and K = {cr'|cr'' < ct, } be the set of 
faces of a r . K is an r-dimensional simplicial complex. For example, take 




Figure 3.4. (a) is a simplicial complex but ( b ) is not. 


a -3 = {popiP 2 P 3 ) (figure 3.3). Then 

K = {po, pu P 2 , P3, (POPl), ( POP 2 ), iPOPl), 

{PlPl), (PlP3), ( P2P3 ), (P0PIP2), (P0P1P3), 

(P0P2P3), {PIP2P3), {POPIP2P3)}- (3.12) 

A simplicial complex A" is a set whose elements are simplexes. If each 
simplex is regarded as a subset of M" ! (m > dim K), the union of all the simplexes 
becomes a subset of ffi" ! . This subset is called the polyhedron K of a simplicial 
complex K. The dimension of | K as a subset of TO"' is the same as that of K ; 
dim | K | = dim K. 

Let X be a topological space. If there exists a simplicial complex K and a 
homeomorphism f : \K\ X, X is said to be triangulable and the pair ( K . f) 
is called a triangulation of X. Given a topological space X, its triangulation is 
far from unique. We will be concerned with triangulable spaces only. 

Example 3.5. Figure 3.5 (a) is a triangulation of a cylinder S l x [0, 1]. The reader 
might think that somewhat simpler choices exist, figure 3.5(b), for example. This 
is, however, not a triangulation since, for <72 = (P 0 P 1 P 2 ) andcrj = (P2P3Po), we 
find <72 Fl a' 2 — (po) U (pi), which is neither empty nor a simplex. 


3.3 Homology groups of simplicial complexes 
3.3.1 Oriented simplexes 

We may assign orientations to an /--simplex for r > 1. Instead of (. . .) for an 
unoriented simplex, we will use (. . .) to denote an oriented simplex. The symbol 
o r is used to denote both types of simplex. An oriented 1 -simplex cri = (pop 1 ) is 
a directed line segment traversed in the direction po —*■ p\ (figure 3.6(a)). Now 



Figure 3.5. (a) is a triangulation of a cylinder while (b) is not. 



Figure 3.6. An oriented 1 -simplex (a) and an oriented 2-simplex ( b ). 


( po p i ) should be distinguished from ( p\ po). We require that 

(popi) = -(pipo)- (3.13) 

Here ’ in front of ( pi po) should be understood in the sense of a finitely 
generated Abelian group. In fact, (pipo) is regarded as the inverse of (popi). 
Going from po to p\ followed by going from p\ to po means going nowhere, 
(POPl) + (PlPo) = 0, hence -(pipo) = (poPi)- 

Similarly, an oriented 2-simplex en = (popipi) is a triangular region 
Po P 1 P 2 with a prescribed orientation along the edges (figure 3.6 (b)). Observe that 
the orientation given by popipi is the same as that given by pipoPi or p \ p 2 po 
but opposite to popipi , P 2 P 1 P 0 or PiPoPi- We require that 

(popipi) = (PiPoPi) = (pi P 2 P 0 ) 

— - (popipi) = -(pipipo) = -(PIPOPI). 

Let P be a permutation of 0, 1,2 

P = 

These relations are summarized as 



(PiPjPk) = sgn(P)(p 0 pipi) 





where sgn( P) = +1 (—1) if P is an even (odd) permutation. 

An oriented 3-simplex 03 = ( po p 1 P 2 P 3 ) is an ordered sequence of four 
vertices of a tetrahedron. Let 


P = 


0 1 2 3 \ 

1 j k l ) 


be a permutation. We define 

( PiPjPkPi ) = sgn(P)(p 0 pi P2P3)- 

It is now easy to construct an oriented r-simplex for any r > 1. The 
formal definition goes as follows. Take r + 1 geometrically independent points 
po- pi - . . . , p r in IK'” . Let { p, 0 , p ;i , . . . , p, r } be a sequence of points obtained by 
a permutation of the points po, . . . , p r . We define {po, . . . , p, } and { p, 0 , . . . , p, r } 
to be equivalent if 

p=(? 1 - r 

y ?0 l l • • • 

is an even permutation. Clearly this is an equivalence relation, the equivalence 
class of which is called an oriented r-simplex. There are two equivalence 
classes, one consists of even permutations of po, ...,p r , the other of odd 
permutations. The equivalence class (oriented r-simplex) which contains 
{po, . . . , p r ] is denoted by a r = (popi . . . p r ), while the other is denoted by 
—o,- — — (popi • • • p r )- In other words, 

(PioPh ■ ■ ■ Pi r ) = sgn(P)(p 0 pi . . . p r ). (3.14) 

For r = 0, we formally define an oriented 0-simplex to be just a point 
op = Po- 


3.3.2 Chain group, cycle group and boundary group 

Let K = \o u \ be an n-dimensional simplicial complex. We regard the simplexes 
o a in K as oriented simplexes and denote them by the same symbols op as 
remarked before. 

Definition 3.2. The r-chain group C, (K) of a simplicial complex K is a free 
Abelian group generated by the oriented r-simplexes of K. If r > dim K, C r (K) 
is defined to be 0. An element of C r {K) is called an r-chain. 

Let there be /, r-simplexes in K. We denote each of them by a r j (1 < i < 
I r ). Then c e C, (K) is expressed as 

lr 

c — ^2 CjCff.i 
1=1 


C[ G Z. 


(3.15) 




Figure 3.7. («) An oriented 1-simplex with a fictitious boundary p \ . ( b ) A simplicial 
complex without a boundary. 

The integers c; are called the coefficients of c. The group structure is given as 
follows. The addition of two /--chains, c — JL cyoy.; and c' = JL cloy.;, is 

C + c' — Y+Ci + c 'd a r,i ■ (3.16) 

i 

The unit element is 0 = 0 • ay j, while the inverse element of c is — c — 

'}2 i (—Ci)(j r j. [ Remark : An oppositely oriented / -simplex —a,, is identified with 
(— l)o> e C r (K ).] Thus, C r (K) is a free Abelian group of rank 

C r (K) = Z © Z © • • • © Z . (3.17) 

^ v ^ 

Ir 

Before we define the cycle group and the boundary group, we need to 
introduce the boundary operator. Let us denote the boundary of an /--simplex 
oy by 3,- ay. d r should be understood as an operator acting on a,- to produce its 
boundary. This point of view will be elaborated later. Let us look at the boundaries 
of lower-dimensional simplexes. Since a 0-simplex has no boundary, we define 

3o,Po = 0. (3.18) 

For a 1-simplex (pop\), we define 

di(POPi) = Pi ~ P0- (3.19) 

The reader might wonder about the appearance of a minus sign in front of pq. 
This is again related to the orientation. The following examples will clarify this 
point. In figure 3.7(a), an oriented 1-simplex (pop 2 ) is divided into two, (popi) 
and (pip2)- We agree that the boundary of ( pop2 ) is {/A)} U {p2\ and so should 
be that of (popi) + (p\p 2 )- If 3i (popi) were defined to be po + p 2 , we would 
have di(popi) + d\(p\p 2 ) — po + Pi + Pi + P 2 - This is not desirable since pi 
is a fictitious boundary. If, instead, we take 3| ( poPl) — P 2 — Po, we will have 
di(popi) + di(piP2) = Pi~ Po + P 2 ~ Pi = P2 - Po as expected. The next 
example is the triangle of figure 3.7(b). It is the sum of three oriented 1-simplexes, 



( po p i ) + ( p\p 2 ) + ( P 2 P 0 )• We agree that it has no boundary. If we insisted on 
the rule di(popi) = po + p 1 , we would have 

di(popi) + di(pip 2 ) + d\(p 2 Po) = PO + Pl+ Pl + P2 + P2 + PO 

which contradicts our intuition. If, on the other hand, we take 3 1 ( /?o /? 1 ) = 
p t — po, we have 

dl(popi) + di(pip 2 ) + di(p 2 po) = pi - po + p 2 - pi + po - p 2 = 0 

as expected. Hence, we put a plus sign if the first vertex is omitted and a minus 
sign if the second is omitted. We employ this fact to define the boundary of a 
general /--simplex. 

Let cr r (po . . . p r ) (/- > 0) be an oriented /--simplex. The boundary 3 , oy of 
oy is an (r — l)-chain defined by 

r 

d r o r = y^f-iypopi ... Pi ... p r ) (3.20) 

1=0 

where the point p, under' is omitted. For example, 

d 2 (poP\P 2 ) = (P 1 P 2 ) ~ (POP2) + (popi) 
h(P0PlP2P2) = (P\P 2 Pl) - (P 0 P 2 Pi) + (popipi) - (P 0 PIP 2 )- 

We formally define do&o — 0 for r = 0. 

The operator d r acts linearly on an element c = c, a r , of C r ( K ), 

3 r c = J]c,'3 r a r ,,'. (3.21) 

i 

The RHS of (3.21) is an element of C r -i(K). Accordingly, 3, defines a map 

dr : C r (K) — > Cr-i(K). (3.22) 

3,- is called the boundary operator. It is easy to see that the boundary operator 
is a homomorphism. 

Let K be an //-dimensional simplicial complex. There exists a sequence of 
free Abelian groups and homomorphisms, 

0 -4 C n (K) C n -i(K) 4...4c!(j:)4 Co(K) -4 o (3.23) 

where i : 0 C n (K) is an inclusion map (0 is regarded as the unit element 
of C n (K)). This sequence is called the chain complex associated with K and 
is denoted by C(K). It is interesting to study the image and kernel of the 
homomorphisms d r ■ 



Definition 3.3. If c e C,(K ) satisfies 


d r c = 0 (3.24) 

c is called an r-cycle. The set of /--cycles Z r (K ) is a subgroup of C,(K ) and is 
called the r-cycle group. Note that Z r (K) — ker3,-. [ Remark : If r = 0, 3oc 
vanishes identically and Zq(K) — Cq(K), see (3.23).] 

Definition 3.4. Let K be an //-dimensional simplicial complex and let c e C,(K). 
If there exists an element d e C r + 1 ( K ) such that 

c = d r +\ d (3.25) 

then c is called an r-boundary. The set of /--boundaries B r (K) is a subgroup 
of C r (K ) and is called the r-boundary group. Note that B, (K ) = im3 r+ i. 
[Remark: B n (K ) is defined to be 0.] 

From lemma 3.1, it follows that Z r (K ) and B, (K) are subgroups of C r (K). 
We now prove an important relation between Z r (K ) and B r (K), which is crucial 
in the definition of homology groups. 

Lemma 3.3. The composite map 3,- o d r +i : C r + \ ( K ) -> C r -i(K) is a zero map; 
that is, 3,- (3, _|_i c) = 0 for any c e C r + 1 ( K ). 

Proof. Since 3,- is a linear operator on C r (K), it is sufficient to prove the identity 
3,- o 3,. + i = 0 for the generators of C r + \ ( K ). If r = 0, 3o o 3i =0 since do is a 
zero operator. Let us assume r > 0. Take a — ( po . . . p r p, + \ ) e C r+ \ (K ). We 
find 


r + 1 

3/-(3r+l cr ) = d r ^2(-l) 1 (po ■ ■ ■ Pi ■ ■ ■ Pr+l) 

i=0 

r + 1 

= ^(-l)‘3r(P0 ---Pi ■■■ Pr+ 1) 

i=0 

r+1 , /'—I 

= 1)‘ ( i)- 7 (P0 ...Pj... Pi ■ ■ ■ Pr+l) 

i =0 ^ 7=0 

r+1 \ 

+ (-l) J ~ 1 (P0---Pi---Pj---Pr+l)j 

j=i + 1 J 

= 1),+; ( P {) ■ ■ ■ Pj ■■■Pi--- Pr+l) 

j<i 

- ^(-l) ,+; (po ■ ■ ■ Pi ■ ■ ■ Pj . . ■ Pr+l ) = o (3.26) 



which proves the lemma. 


□ 


Theorem 3.3. Let Z r {K) and B r (K) be the r -cycle group and the /'-boundary 
group of C r (K), then 


Br(K) C Z r (K) (C C r (K)). (3.27) 

Proof. This is obvious from lemma 3.3. Any element c of B, (K) is written as 
c = d r +id f°r some d e C r +\(K). Then we find 3 r c = 3,.(3 r+ i<7) = 0, that is, 
c e Z, (K). This implies Z r (K ) D B r (K). □ 

What are the geometrical pictures of r -cycles and /•-boundaries? With our 
definitions, d r picks up the boundary of an /--chain. If c is an r-cycle, 3 r c — 0 tells 
us that c has no boundary. If c — <) r+ \ d is an r-boundary, c is the boundary of d 
whose dimension is higher than c by one. Our intuition tells us that a boundary 
has no boundary, hence Z, (K ) D B, (K). Those elements of Z r (K) that are not 
boundaries play the central role in this chapter. 

3.3.3 Homology groups 

So far we have defined three groups C r (K), Z r (K) and B, ( K ) associated with 
a simplicial complex K. How are they related to topological properties of K or 
to the topological space whose triangulation is K? Is it possible for C r (K) to 
express any property which is conserved under homeomorphism? We all know 
that the edges of a triangle and those of a square are homeomorphic to each other. 
What about their chain groups? For example, the 1 -chain group associated with a 
triangle is 


C\{Ki) = [i(popi) + jipipi) + k(p2Po)\i, j,ke Z} 

= z® z®z 

while that associated with a square is 

C\(K 2 ) = Z®Z®Z®Z. 

Clearly C\(K\) is not isomorphic to Cj ( Ki), hence C r (K ) cannot be a candidate 
of a topological invariant. The same is true for Z, (K) and B r (K). It turns out 
that the homology groups defined in the following provide the desired topological 
invariants. 

Definition 3.5. Let K be an //-dimensional simplicial complex. The rth 
homology group H r (K ), 0 < r < //, associated with K is defined by 

H r (K) = Z,(K)/B r (K). (3.28) 

[Remarks: If necessary, we define H r (K ) = 0 for r > n or r < 0. If we 
want to stress that the group structure is defined with integer coefficients, we 



write H r (K; Z). We may also define the homology groups with E-coefficients, 
H,(K\ ffi) or those with Z 2 -coefficients, H, (K\ Z 2 ).] 

Since B, (K ) is a subgroup of Z r (K), H r {K ) is well defined. The group 
H, (K ) is the set of equivalence classes of /'-cycles, 

H r (K) = {Ml Z e Z r (K)} (3.29) 

where each equivalence class [z] is called a homology class. Two / -cycles z and 
z' are in the same equivalence class if and only if - — z! e B r (K ), in which case z 
is said to be homologous to z! and denoted by z ~ z! or [z] = [z'{. Geometrically 
z — z! is a boundary of some space. By definition, any boundary b e B r (K ) is 
homologous to 0 since b — Oe B, (K). We accept the following theorem without 
proof. 

Theorem 3.4. Homology groups are topological invariants. Let X be 
homeomorphic to Y and let ( K . f ) and (L,g) be triangulations of X and Y 
respectively. Then we have 


H r (K) = H r (L ) r =0, 1,2, .... (3.30) 

In particular, if ( K , /) and (L, g) are two triangulations of X, then 

H r (K) = H r (L ) r =0, 1,2, .... (3.31) 

Accordingly, it makes sense to talk of homology groups of a topological 

space X which is not necessarily a polyhedron but which is triangulable. For an 
arbitrary triangulation (K, /), H, (X) is defined to be 

H r (X) = H r (K ) r = 0, 1, 2 (3.32) 

Theorem 3.4 tells us that this is independent of the choice of the triangulation 

(K, /)■ 


Example 3.6. Let K = { /?o } - The 0-chain is Cq(K) — {///ok e Z} = Z. Clearly 
Zq{K) — Co(K) and Bq(K ) = {0} (dopo — 0 and po cannot be a boundary of 
anything). Thus 


H 0 (K) = Z 0 (K)/B 0 (K ) = C 0 (K) = Z. (3.33) 

Exercise 3.1. Let K = {po. p\ ) be a simplicial complex consisting of two 0- 
simplexes. Show that 


H r (K) = 


jz ® Z 

({ 0 } 


(r = 0) 
(r # 0). 


(3.34) 



Example 3.7. Let K — {po, pi, (popi)}- We have 

Co(K) - {ipo + jpi\i, j e Z} 

Ci (AT) = {&(/?oPi)|£ e Z}. 

Since (popi) is not a boundary of any simplex in K, B \(K) = {0} and 
H 1 (^) = Z 1 (/f)/Bi(^) = Z 1 (/0. 

If z = m(popi) e Z\{K), it satisfies 

3i z = mdi(popi) = m{p i - pol = mp t - m/?o = 0. 

Thus, m has to vanish and Z i ( AT ) = 0, hence 

H\{K) = 0. (3.35) 

As for Ho(K), we have Zq(K) = Co(K) = {ipo + jp \ } and 

B q (K) = im 3, = {dii(popi)\i e Z} = {i(p 0 - pi)\i e Z}. 

Define a surjective (onto) homomorphism / : Zo(K) — > Z by 

f(ipo + jp\) = i + j- 


Then we find 

ker / = / _1 (0) = Bq(K). 

Theorem 3.1 states that Zo(K) / ker / = im f — Z, or 

H 0 (K) = Zo(K)/B 0 (K) = Z. (3.36) 

Example 3.8. Let K = {p 0 , pi, p 2 , (popi), (pipi), (P 2 Po)}, see figure 3.7 (b). 
This is a triangulation of S 1 . Since there are no 2-simplexes in K, we have 
Bi(K) = 0 and Hi (AT) = Z x (K)/Bi(K) = Z X (K). Let z = i(poPi) + 
j(p\Pl) + k(p 2 po) e Zi(H) where i, j , k e Z. We require that 

9iz = * (Pi - Po) + j (P 2 - Pi) + k(po - Pi) 

= (k - i)po + O' - j)p l + (j - k)p 2 = 0. 

This is satisfied only when i = j — k. Thus, we find that 

Zi(K) = {i{(popi) + (pipi) + (P 2 Po)}\i e Z). 

This shows that Zi(K) is isomorphic to Z and 


Hi(K) = Zi(K) = Z. 


(3.37) 


Let us compute Hq(K). We have Zq(K) = Co(K ) and 

Bq(K) = ldi[l(popi) + m(p\ p 2 ) + n(p 2 pa)]\U m, n e Z} 

= {( n — Z) po + (l — m) p\ + (m — n)p2 | /, m, 77 e Z}. 

Define a surjective homomorphism / : Zq(K) -* Z by 

f(ipo + jPi + kp2) = i + j +k. 

We verify that 

ker / = / _1 (0) = Bq(K). 

From theorem 3.1 we find Zq(K)/ ker / = im / = Z, or 

H 0 (K) = Z 0 (K)/Bq(K) = Z. (3.38) 

K is a triangulation of a circle S 1 , and (3.37) and (3.38) are the homology 
groups of S 1 . 

Exercise 3.2. Let K = {p 0 , pi, pi, P3, (P0Pi), (p\Pl), (P 2 P 3 ), (pipo)} be a 
simplicial complex whose polyhedron is a square. Verify that the homology 
groups are the same as those of example 3.8 above. 

Example 3.9. Let K = {p 0 , pi, p 2 , (popi), (P 1 P 2 ), (.P 2 P 0 ), (POPiPl)}', see 
figure 3.6(b). Since the structure of O-simplexes and 1-simplexes is the same 
as that of example 3.8, we have 


H 0 (K) = Z. (3.39) 

Let us compute Hi(K) = Zi(K) / Bi(K). From the previous example, we 

have 

Z\(K) = [i{(popi) + (pipi) + (p 2 Po)}\i e Z}. 

Let c — m(popip 2 ) e CiiK). If b — 82 c e Bi(K), we have 

b = m{(pip 2 ) - (P 0 P 2 ) + (popi)} 

= m{(p 0 pi) + (P 1 P 2 ) + (p 2 Po)} m e Z. 

This shows that Zi(K) = Bi(K), hence 

H\(K) = Z\(K)/B\(K) = {0}. (3.40) 

Since there are no 3-simplexes in K . we have lh( K) = {0}. Then 
H 2 (K) = Z 2 (K)/B 2 (K) = Z 2 (K). Let z = m(p 0 pip 2 ) e Z 2 (K). Since 
32 z = m{(pip 2 ) - (pop 2 ) + (popi)} = 0 , m must vanish. Hence, Z\(K) = {0} 
and we have 


H 2 (K) = {0}. 


(3.41) 


Exercise 3.3. Let 


K — [po, PI,P2, P3, ( POPl ), (POP2), (POP3)’ (P1P2), (PlP3), (P2P3), 
iPOPlPl), (P0PIP3), (P0P2P3), (P1P2P3)} 

be a simplicial complex whose polyhedron is the surface of a tetrahedron. Verify 

that 

Hq(K) = Z Hi(K)^{0} H 2 (K) = Z. (3.42) 

K is a triangulation of the sphere .S' 2 and (3.42) gives the homology groups of S 2 . 


3.3.4 Computation of Hq (K ) 

Examples 3. 6-3. 9 and exercises 3.2, 3.3 share the same zeroth homology group, 
Ho(K) = Z. What is common to these simplicial complexes? We have the 
following answer. 

Theorem 3.5. Let A" be a connected simplicial complex. Then 

H 0 (K) = Z. (3.43) 


Proof. Since K is connected, for any pair of O-simplexes /?,- and pj , there exists 
a sequence of 1-simplexes ( PiPk ), ( PkPl ), ■ ■ ■ , ( PmPj ) such that 3i ((pipk) + 
(Pk pi) + • • • + ( PmPj )) = Pj ~ Pi- Then it follows that pi is homologous 
to pj, namely [ /;, ] = [pf]. Thus, any 0-simplex in K is homologous to p\ say. 
Suppose 

h 

z = 'Y^ J n iPi e Z 0 (K) 

i= 1 

where Iq is the number of O-simplexes in K. Then the homology class [/] is 
generated by a single point, 


[z] 


J2 n ’ P i 


Ymp* = ^2 n i[p\]- 

i i 


It is clear that [z] = 0, namely z e Bq(K), if n i = 0. 

Let Oj = (jOj, i Pj. 2 ) (1 < j < I\) be 1-simplexes in K , I\ being the number 
of 1-simplexes in K, then 


Bq(K) = im3i 

= {3i(«icti H b npoif) \n\, ..., nj l e Z} 

= {n\(p\,2 - Pl.t) H b npipii, 2 ~ Ph,l)\n\, ..., n/ t e Z}. 

Note that nj (1 < j < 1 1 ) always appears as a pair +nj and —nj in an element 
of Bq(K). Thus, if 

z = J2 n JPj e Bo(K) then ^2, n j = 0- 



Po P2 P3 Pi 



Figure 3.8. A triangulation of the Mobius strip. 


Now we have proved for a connected complex K that z — n ; p, e Bq(K) if 

and only if n, = 0. 

Define a surjective homomorphism / : Zq(K) -> Z by 

Io 

f(nipi H b«/ 0 £>/ 0 ) = X!"'- 

i=l 

We then have ker / = / _1 (0) = Bq(K). It follows from theorem 3.1 that 
Hq(K) = Z 0 (K)/Bo(K ) = Zq(K)/ ker / = im / = Z. □ 

3.3.5 More homology computations 

Example 3.10. This and the next example deal with homology groups of non- 
orientable spaces. Figure 3.8 is a triangulation of the Mobius strip. Clearly 
BnlK) — 0. Let us take a cycle z e Zi(K), 

Z = i{pop\ pi) + jipipiPA) + k(p2P4P3) 

+ 1(P3P4P5) + m(p3P5Pl ) + nipipspo). 


z satisfies 


diz = i{(piP2) - ( POP2 ) + (popi)} 

+ j{(P\P4) - (P2P4) + (P2Pl)} 

+ k{(p4P3) - (P2P3) + (P2P4)} 

+ H(P4P5) ~ ( P3P5 ) + (P3P4)} 

+ m{(p 5 pi) - (p3Pl) + (P3P5)} 

+ nlipspo) - (pipo) + (pips)} = 0 . 


Since each of ( p 0 p 2 ), (pi Pa), (P 2 P 3 ), (P4P5)< (P3Pi) and (pspo) appears once 
and only once in djz, all the coefficients must vanish, i — j — k — l ' — m — n — 




0. Thus, Z 2 (K) = {0} and 


H 2 (K) = Z 2 (K)/B 2 (K) = {0}. (3.44) 

To find H i ( K ), we use our intuition rather than doing tedious computations. 
Let us find the loops which make complete circuits. One such loop is 

Z = (POPl) + (P\P4) + (P4Pi) + (P5Po)- 

Then all the other complete circuits are homologous to multiples of z. For 
example, let us take 

Z = (p\p 2 ) + (p 2 p?) + (P3P?>) + (P3P\)- 
We find that z ~ z! since 

Z-Z' = d 2 {(p 2 p\p4) + (P2P4P?) + (P3P4P?>) + (P\P5PV)}- 
If, however, we take 

z" = ipiPA) + (P4P5) + (P5Po) + (POP2) + (P 2 P?,) + (P3Pl) 

we find that z" ~ 2z since 

2z - z" = 2 (pq/2i) + (pi/?4) + (P4P5) + (P5Po) - (pop?) 

- ( P2P 3) - (P3P\) 

= d2{(P0PlP2) + (PIP4P2) + (P2P4P3) + (P3P4P5) 

+ (P3P3P\) + (PGPIP5)}- 

We easily verify that all the closed circuits are homologous to nz, n e Z. H[(K) 
is generated by just one element [z], 

H\(K) — {i[z]|i eZ) = Z. (3.45) 

Since K is connected, it follows from theorem 3.5 that //q( K ) = \i[p a ]\i € 
Z) = Z, p a being any 0-simplex of K. 


Example 3.11. The projective plane HP 2 has been defined in example 2.5(c) as 
the sphere S 2 whose antipodal points are identified. As a coset space, we may 
take the hemisphere (or the disc D 2 ) whose opposite points on the boundary .S' 1 
are identified, see figure 2.5(b). Figure 3.9 is a triangulation of the projective 
plane. Clearly B 2 (K) = {0}. Take a cycle z e Z 2 (K), 

Z = mi(p 0 pip 2 ) + m 2 (pop4P\) + m 2 (pop 5 p4) 

+ m 4 (poP3P5) + rns(pQp 2 p 3 ) + m b (p 2 p 4 p 2 ,) 

+ m-i(p 2 p 5 p4) + m&(p 2 pi Ps) + m 9 (pip3p 5 ) + wio(PiP4P3)- 


Ps 



Figure 3.9. A triangulation of the projective plane. 


The boundary of z is 

d 2 z = mi{(piP 2 ) - ( POP2 ) + (popi)} 

+ m 2 {(p4Pl) - (popi) + (P0P4)} 

+ m 3 {(p 5 p 4 ) - (pop 4 ) + (P0P5)} 

+ m4{(p3P5 ) - (P0P5) + (POP3)} 

+ W5{(P2P3) - (POP3) + (POP2)} 

+ »*6{(P4P3) - (P2P3) + (P2P4)} 

+ W 7 {(P5P4) - (P2P4) + (P2P5)} 

+ Ws{(piP5) - (P2P5) + (P2Pl)} 

+ W9{(P3P5) - (P1P5) + (P1P3)} 

+ Wio{(P4P3) - (P1P3) + (P1P4)} = 0. 

Let us look at the coefficient of each 1 -simplex. For example, we have (mi — 
m 2 )(poPi), hence m i — m 2 = 0. Similarly, 

— mi + /«5 = 0, m 4 — ms = 0, m 2 — m3 = 0, mi — mg = 0, 
m 9 — mio = 0, —m2 + mio = 0, ms — mg = 0, mj — m 7 = 0, 
m 5 + mio = 0. 


These ten conditions are satisfied if and only if m; =0, 1 < i < 10. This means 
that the cycle group Z 2 (K) is trivial and we have 

H 2 (K) = Z 2 (K)/B 2 (K) = {0}. (3.46) 

Before we calculate H\(K), we examine H 2 (K) from a slightly different 
viewpoint. Let us add all the 2-simplexes in K with the same coefficient, 

to 

z = / ^ ma 2 j 77i e Z. 

!=1 

Observe that each 1-simplex of K is a common face of exactly two 2-simplexes. 
As a consequence, the boundary of - is 

d 2 z = 2m(p 2 p 5 ) + 2m(p 5 p4) + 2m(p4p 2 ). (3.47) 

Thus, if z e Z 2 (K), m must vanish and we find Z 2 (K) = {0} as before. This 
observation remarkably simplifies the computation of H\(K). Note that any 1- 
cycle is homologous to a multiple of 

2 = (PlPi) + (P5P4) + (P4P3) 

cf example 3.10. Furthermore, equation (3.47) shows that an even multiple of z is 
a boundary of a 2-chain. Thus, z is a cycle and z + z is homologous to 0. Hence, 
we find that 

Hi(K) = [[z]\[z] + [z] ~ [0]} = Z 2 . (3.48) 

This example shows that a homology group is not necessarily free Abelian but 
may have the full structure of a finitely generated Abelian group. Since K is 
connected, we have Hq(K ) = Z. 

It is interesting to compare example 3.11 with the following examples. 
In these examples, we shall use the intuition developed in this section on 
boundaries and cycles to obtain results rather than giving straightforward but 
tedious computations. 

Example 3.12. Let us consider the torus T 2 . A formal derivation of the homology 
groups of T 2 is left as an exercise to the reader: see Fraleigh (1976), for example. 
This is an appropriate place to recall the intuitive meaning of the homology 
groups. The rth homology group is generated by those boundaryless /--chains 
that are not, by themselves, boundaries of some (/- + l)-chains. For example, 
the surface of the torus has no boundary but it is not a boundary of some 3- 
chain. Thus, H 2 (T 2 ) is freely generated by one generator, the surface itself, 
H 2 (T 2 ) = Z. Let us look at H\(T 2 ) next. Clearly the loops a and b in figure 3.10 
have no boundaries but are not boundaries of some 2-chains. Take another loop 
a', a' is homologous to a since a 1 — a bounds the shaded area of figure 3.10. 



Figure 3.10. a ' is homologous to a but b is not. a and b generate H\(T 2 ). 




Figure 3.11. a, and /?, (!</< g) generate Hi (Eg). 


Hence, H\(T 2 ) is freely generated by a and b and H\(T 2 ) = Z ® Z. Since T 2 is 
connected, we have Hq(T 2 ) = Z. 

Now it is easy to extend our analysis to the torus E g of genus g. Since E g has 
no boundary and there are no 3-simplexes, the surface E g itself freely generates 
H 2 (T 2 ) = Z. The first homology group ll\ (Eg) is generated by those loops 
which are not boundaries of some area. Figure 3.11 shows the standard choice for 
the generators. We find 

Hi(E ? ) = {f i [«i] + j\[b\] H h i g [ag\ + j g [b g ]} 

— Z ® Z © • • • © Z . (3.49) 

' 

2 g 

Since E g is connected, HoCE g ) = Z. Observe that (/;,) is homologous to the 
edge aj(bj) of figure 2.12. The 2 g curves {a,-, /?,} are called the canonical system 
of curves on E ? . 

Example 3.13. Figure 3.12 is a triangulation of the Klein bottle. Computations of 
the homology groups are much the same as those of the projective plane. Since 
B 2 (K) = 0, we have H 2 (K) = Z 2 (K). Let z € Z 2 (K). If z is a combination 
of all the 2-simplexes of K with the same coefficient, z — ^mcr 2 j, the inner 
1-simplexes cancel out to leave only the outer 1-simplexes 


d 2 z = —2 ma 



Po 


p 2 a p. 


Po 


Figure 3.12. A triangulation of the Klein bottle. 


where a — (popi) + (pi pi) + (P 2 Po)- For d 2 z to be 0, the integer in must vanish 
and we have 


H 2 (K) = Z 2 (K ) = {0}. (3.50) 


To compute H\{K ) we first note, from our experience with the torus, that 
every 1 -cycle is homologous to ia + jb for some i, j e Z. For a 2-chain to have 
a boundary consisting of a and b only, all the 2-simplexes in K must be added 
with the same coefficient. As a result, for such a 2-chain z — mcT 2,i, we have 
dz = 2m a. This shows that 2m a ~ 0. Thus, H\(K ) is generated by two cycles a 
and b such that a + a — 0, namely 


H\{K) — {i[a] + j[b]\i, j e Z} = Z 2 (B Z. (3.51) 


We obtain Hq(K ) = Z since K is connected. 



3.4 General properties of homology groups 
3.4.1 Connectedness and homology groups 

Let K — { pa) and L — { po, p\ \. From example 3.6 and exercise 3.1, we have 
H q (K) = Z and Hq{L) — Z ©Z. More generally, we have the following theorem. 

Theorem 3.6. Let L be a disjoint union of N connected components, K — 
K\ U K 2 U • • • U Kn where K( fl Kj — 0. Then 

H r (K) = H r (Ki) © H r (K 2 ) 0 • • • 0 H r (K N ). (3.52) 

Proof. We first note that an r -chain group is consistently separated into a direct 
sum of N r -chain subgroups. Let 

C r (K) — y^CjOrj Cj e Z \ 

1=1 ' 

where I, is the number of linearly independent r-simplexes in K. It is always 
possible to rearrange 07 so that those r-simplexes in K \ come first, those in K 2 
next and so on. Then C r (K ) is separated into a direct sum of subgroups, 

Cr(K) = Cr(Ki ) © Cr(K 2 ) © • ■ ■ © C r (K N ). 

This separation is also carried out for Z r (K) and B, (K) as 

Z r (K) = Z r {K\) © Z t (K 2 ) © • • • © Z r (K N ) 

B r (K ) = B r (K\) © B,.(K 2 ) © • • • © B r (K N ). 

We now define the homology groups of each component K, by 

H r (K t ) - Z r (Ki)/B r (Ki). 

This is well defined since Z r (Kj) D B, (Kj). Finally, we have 
H r (K) = Z r (K)/B r (K ) 

= Z, (K\) © • • • © Z r ( K n) / B r ( K \) © • • • © B, (Kn) 

= [Z r {K\)/ B r {K\)} © • • • © {Z r (K N )/B r (K N )} 

= H r (Ki)® •••©//, (K n ). □ 

Corollary 3.1. (a) Let If be a disjoint union of N connected components, 
K\, . . . , Kff. Then it follows that 

Hq(K) = Z©---©Z . (3.53) 

N factors 

(b) If Hq(K) = Z, K is connected. [Together with theorem 3.5 we conclude 
that Hq(K) = Z if and only if K is connected.] 



3.4.2 Structure of homology groups 


Z r (K) and B, (K) are free Abelian groups since they are subgroups of a free 
Abelian group C,(K). It does not mean that H r (K ) = Z r (K)/ B r (K) is also free 
Abelian. In fact, according to theorem 3.2, the most general form of H, (K ) is 

H r (K) = Z 0 • • ■ ® Z eZt, 0 • • • 0 Z fcg . (3.54) 

/ 

It is clear from our experience that the number of generators of H, (K) counts 
the number of ( r + 1) -dimensional holes in |K|. The first / factors form a free 
Abelian group of rank / and the next p factors are called the torsion subgroup 
of H r (K). For example, the projective plane has H\(K) = 7 2 and the Klein 
bottle has H\(K ) = Z © Z 2 . In a sense, the torsion subgroup detects the 
‘twisting’ in the polyhedron |K|. We now clarify why the homology groups with 
Z -coefficients are preferable to those with Z 2 - or M-coefficients. Since Z 2 has no 
non-trivial subgroups, the torsion subgroup can never be recognized. Similarly, 
if IR-coefficients are employed, we cannot see the torsion subgroup either, since 
IR/mIR = {0} for any m e Z — {0}. [For any a, b e K, there exists a number 
cel such that a — b — me.] If H r (K; Z) is given by (3.54), H r (K; M) is 

H r (K; M) = M ® ® • • • (B M . (3.55) 

^ v 

f 

3.4.3 Betti numbers and the Euler-Poincare theorem 

Definition 3.6. Let K be a simplicial complex. The rth Betti number b r (K ) is 
defined by 

b r (K) = dim H r (K; IK). (3.56) 

In other words, b r (K) is the rank of the free Abelian part of H r (K\ Z). 

For example, the Betti numbers of the torus T 2 are (see example 3.12) 
b 0 (K)=l, bi(K) = 2, b 2 {K)= 1 

and those of the sphere S 2 are (exercise 3.3) 

b 0 (K)= 1, h(K) = 0, b 2 (K)= 1. 

The following theorem relates the Euler characteristic to the Betti numbers. 

Theorem 3.7. (The Euler-Poincare theorem) Let K be an n -dimensional 
simplicial complex and let /,■ be the number of r-simplexes in K. Then 

n n 

X(K) = J](-l)'7,- = J](-1)'MK). (3.57) 

r=0 r=0 

[Remark: The first equality defines the Euler characteristic of a general 
polyhedron |K|. Note that this is the generalization of the Euler characteristic 
defined for surfaces in section 2.4.] 



Proof. Consider the boundary homomorphism, 

a,. : C r {K\ IR) -> W) 

where C-i(K\ M) is defined to be {0} . Since both C, _ i (K ; TO) and C r (K; IR) are 
vector spaces, theorem 2. 1 can be applied to yield 

/,• = dm\C r (K\ IR) = dim(ker9, ) + dim(im d r ) 

— dim Z r (K; IR) + dim B r -\ (K ; IR) 

where B-i(K) is defined to be trivial. We also have 

b r (K) = dim H r {K\ TO) = dim(Z,.(^; W)/B r {K\ IR)) 

= dim Z r (K; IR) — dim 5, IR). 

From these relations, we obtain 

n n 

X (K) = £](-l)'7, = ^(— l)' (dim Z r (K\ IR) + dim B,- { (K; 1)) 

r = 0 r=0 

n 

= ^{(-i) r dim Z r (K\ IR) - (-If dimB r (^T; IR)} 

r= 0 
n 

= J](-i YhAR). □ 

r= 0 

Since the Betti numbers are topological invariants, x(K) is also conserved 
under a homeomorphism. In particular, if / : K -* X and g : \ K ' \ — >■ X are 
two triangulations of X , we have y (K ) = x(K'). Thus, it makes sense to define 
the Euler characteristic of X by x (K ) for any triangulation (K, f) of X . 



Figure 3.13. A hole in S 2 . whose edges are identified as shown. We may consider S 2 with 
q such holes. 



Problems 


3.1 The most general orientable two-dimensional surface is a 2-sphere with h 
handles and q holes. Compute the homology groups and the Euler characteristic 
of this surface. 

3.2 Consider a sphere with a hole and identify the edges of the hole as shown in 
figure 3.13. The surface we obtained was simply the projective plane RP 2 . More 
generally, consider a sphere with q such 'crosscaps’ and compute the homology 
groups and the Euler characteristic of this surface. 
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HOMOTOPY GROUPS 


The idea of homology groups in the previous chapter was to assign a group 
structure to cycles that are not boundaries. In homotopy groups, however, we 
are interested in continuous deformation of maps one to another. Let X and Y 
be topological spaces and let S' be the set of continuous maps, from X to Y. We 
introduce an equivalence relation, called ‘homotopic to’ , in S by which two maps 
/, g e S are identified if the image f(X) is continuously deformed to g(X) in 
Y . We choose X to be some standard topological spaces whose structures are 
well known. For example, we may take the n- sphere S" as the standard space and 
study all the maps from S n to Y to see how these maps are classified according to 
homotopic equivalence. This is the basic idea of homotopy groups. 

We will restrict ourselves to an elementary study of homotopy groups, which 
is sufficient for the later discussion. Nash and Sen (1983) and Croom (1978) 
complement this chapter. 


4.1 Fundamental groups 

4.1.1 Basic ideas 

Let us look at figure 4.1. One disc has a hole in it, the other does not. What 
characterizes the difference between these two discs? We note that any loop in 
figure 4.1(£>) can be continuously shrunk to a point. In contrast, the loop a in 
figure 4.1 (a) cannot be shrunk to a point due to the existence of a hole in it. Some 
loops in figure 4.1(a) may be shrunk to a point while others cannot. We say a loop 
a is homotopic to p if a can be obtained from /I by a continuous deformation. For 
example, any loop in Y is homotopic to a point. It turns out that ‘homotopic to’ 
is an equivalence relation, the equivalence class of which is called the homotopy 
class. In figure 4.1, there is only one homotopy class associated with Y . In X, 
each homotopy class is characterized by n eZ, n being the number of times the 
loop encircles the hole; n < 0 if it winds clockwise, n > 0 if counterclockwise, 
n = 0 if the loop does not wind round the hole. Moreover, Z is an additive group 
and the group operation (addition) has a geometrical meaning; n + m corresponds 
to going round the hole first n times and then m times. The set of homotopy 
classes is endowed with a group structure called the fundamental group. 



Figure 4.1. A disc with a hole (a) and without 
a from shrinking to a point. 



hole (b). The hole in (a) prevents the loop 


4.1.2 Paths and loops 

Definition 4.1. Let X be a topological space and let / = [0, 1]. A continuous 
map a \ I —*■ X is called a path with an initial point xo and an end point xi if 
a(0) = xo and a(l) = x\. If or(0) = a(l) = xo, the path is called a loop with 
base point xo (or a loop at xo). 

For x e X, a constant path c x : / — *■ X is defined by Cjc(i) — x, s e I . A 
constant path is also a constant loop since c x (0) = c x ( 1 ) = x. The set of paths 
or loops in a topological space X may be endowed with an algebraic structure as 
follows. 


Definition 4.2. Let a, /J : I — >■ X be paths such that a {\ ) = /AO). The product of 
or and /3, denoted by a * ft, is a path in X defined by 


a * /3(s) = 


| a(2 s) 


0<s<2 

j<s<l 


(4.1) 


see figure 4.2. Since a(l) = yS (0), a * ft is a continuous map from 1 to X. 
[Geometrically, a. * (5 corresponds to traversing the image a (I), in the first half, 
then followed by fi{I) in the remaining half. Note that the velocity is doubled.] 


Definition 4.3. Let a : I — »■ X be a path from xo to xi . The inverse path a 1 of a 
is defined by 

a -1 ( 5 ) = a(l — s) sel. (4.2) 

[The inverse path or -1 corresponds to traversing the image of a in the opposite 
direction from x\ toxo-] 

Since a loop is a special path for which the initial point and end point agree, 
the product of loops and the inverse of a loop are defined in exactly the same way. 


I 


1 



Figure 4.2. The product a * /3 of paths a and with a common end point. 


It seems that a constant map c x is the unit element. However, it is not: a * a~ ] 
is not equal to c x ! We need a concept of homotopy to define a group operation in 
the space of loops. 


4.1.3 Homotopy 

The algebraic structure of loops introduced earlier is not so useful as it is. For 
example, the constant path is not exactly the unit element. We want to classify the 
paths and loops according to a neat equivalence relation so that the equivalence 
classes admit a group structure. It turns out that if we identify paths or loops that 
can be deformed continuously one into another, the equivalence classes form a 
group. Since we are primarily interested in loops, most definitions and theorems 
are given for loops. However, it should be kept in mind that many statements are 
also applied to paths with proper modifications. 


Definition 4.4. Let a, : I —*■ X be loops at xq . They are said to be homotopic, 
written as a ~ /3, if there exists a continuous map F : I x / X such that 


F(s, 0) = cr(s), F(s, 1) = /3(s) Vs e I 
F(0,t) = F(l,t) = x 0 Vr e /. 


(4.3) 


The connecting map F is called a homotopy between a and fi. 


It is helpful to represent a homotopy as figure 4.3(a). The vertical edges of 
the square / x / are mapped to xq. The lower edge is a(s) while the upper edge 
is fi(s). In the space X. the image is continuously deformed as in figure 4.3 (b). 


Proposition 4.1. The relation a ~ is an equivalence relation. 



Figure 4.3. (a) The square represents a homotopy F interpolating the loops a and fi. ( b ) 
The image of a is continuously deformed to the image of fi in real space X. 


t 
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G 


fi 

F 


a 


s 

Figure 4.4. A homotopy H between a and y via fi. 


Proof. Reflectivity, a ~ a. The homotopy may be given by F(s. t) — a (s ) for 
any tel. 

Symmetry. Let a ~ f with the homotopy F(s, t) such that F (s , 0) = a(s), 
F(s, 1) = f(s). Then ft ~ a, where the homotopy is given by F(s , 1 — t). 

Transitivity. Let a ~ f and fi ~ y. Then a ~ y. If F(s, t) is a homotopy 
between a and fi and G(s,t ) is a homotopy between fi and y, a homotopy 
between a and y may be (figure 4.4) 

| F(s, 2 1) 0 < t < j 

I G(s, 2t — 1) \<t< 1. 


H(s,t) = 


□ 





4.1.4 Fundamental groups 

The equivalence class of loops is denoted by [a] and is called the homotopy 
class of a. The product between loops naturally defines the product in the set of 
homotopy classes of loops. 


Definition 4.5. Let X be a topological space. The set of homotopy classes of loops 
at xo e X is denoted by n\ (X . xq) and is called the fundamental group (or the 
first homotopy group) of X at xq . The product of homotopy classes [a] and [ft\ 
is defined by 

[a] * IP] = [a* PI (4.4) 


Lemma 4.1. The product of homotopy classes is independent of the representa- 
tive, that is, if a ~ a 1 and fl ~ ft, then a * fl ~ a' * ft . 


Proof. Let F(s, t) be a homotopy between a and a' and G (,v , t) be a homotopy 
between fl and ft . Then 


H(s, t) 


) F(2s, t) 

I G(2s - l,f) 


°<j < i 

3 - 1 


is a homotopy between a * fl and a’ * ft, hence a * fl ~ cf * ft and [a] * [fl] is 
well defined. □ 


Theorem 4.1. The fundamental group is a group. Namely, if a, ft . . . are loops at 
x e X, the following group properties are satisfied: 

a) (M * m * m = [«] * m * M) 

(2) [a] * [c x ] = [a] and [c*] * [a] = [a] (unit element) 

(3) [a] * [a -1 ] = [c*], hence [a] -1 = [a -1 ] (inverse). 


Proof. (1) Let F(s, t ) be a homotopy between (a * fl) * y and a * (fl * y). It may 
be given by (figure 4.5(a)) 


F(s, t) = 


As 

1 + t 
P(4 s -t- 1) 
(As-t-2 

y \^T- 


0 < s < 


1 + 1 


1 + t 2 + t 

< s < 

4 _ _ 4 

2 + t 

< s < 1. 


Thus, we may simply write [a * fl * y] to denote [(a * fl) * y] or [a * (fl * /)]. 


(a) (b) 

tv /> y a 




S 


s 


Figure 4.5. (a) A homotopy between (a * f)) * y and a *(/3*y). (b) A homotopy between 
a * c x and a. 


(2) Define a homotopy F(s, t ) by (figure 4.5(b)) 


F(s, t) = 


2s 

1 + t 


0 < s < 


t + 1 


t + 1 


< 5 < 1 . 


Clearly this is a homotopy between a * c x and a. Similarly, a homotopy between 
c x * a and a is given by 


F(s, t) = 


0 < 5 < 


1 - t 


2s — 1 + t \ 1 — t 

1 + t 


< s < 1. 


This shows that [a] * [c v ] = [a] = [c*] * [a]. 
(3 ) Define a map F : / x / — > X by 


F(s, t) = 


a(2s(l - 0) 


0 < s < 


I or(2(l — s)(l — t)) i<i<l. 


Clearly F(s. 0) = a * a -1 and F(s, 1) = c x , hence 

[a * a -1 ] = [a] * [a -1 ] = [c*]. 

This shows that [a -1 ] = 


□ 


In summary, n\(X, x) is a group whose unit element is the homotopy class 
of the constant loop c x . The product [a] * [/l] is well defined and satisfies the 





Figure 4.6. From a loop a at XQ, a loop 1 * a * rj at x\ is constructed. 


group axioms. The inverse of [a] is [ct ] -1 = [a -1 ]. In the next section we 
study the general properties of fundamental groups, which simplify the actual 
computations. 

4.2 General properties of fundamental groups 

4.2.1 Arcwise connectedness and fundamental groups 

In section 2.3 we defined a topological space X to be arcwise connected if, for 
any xo, x\ e X , there exists a path a such that ct( 0 ) = xo and a{ 1 ) = x\. 

Theorem 4.2. Let X be an arcwise connected topological space and let xo, xi e 
X. Then iti(X, xo) is isomorphic to 7ti(X, x\). 

Proof. Let ?;:/—> X be a path such that >;(0) = xo and 77 ( 1 ) = xi. 
If a is a loop at xo, then rj~ l * a * ;/ is a loop at xi (figure 4.6). Given an 
element [a] e tt\(X , xo), this correspondence induces a unique element [a'] = 
[rj~ l * a * rj] e n\{X, x\). We denote this map by P, ; : tt\{X, xo) —> tt\(X, xi) 
so that [a'] = P, ; ([a]). 

We show that P rj is an isomorphism. First, P rj is a homomorphism, since for 
[a], [/l] e n i(Z, xo), we have 

A ; ([«] * [yS]) = * [«] * [yS] * [77] 

= bi~ l ] * [a] * [>?] * t'T 1 ] * [f] * M 
= Pn ([<*]) * Pri (.[ft])- 

To show that P t] is bijective, we introduce the inverse of P rj . Define a map 
Pf l : Tt\{X,x\) — >■ tti(X,xo) whose action on [a'] is P ) j“ 1 ([a / ]) = 



Clearly P 1 is the inverse of P, t since 


P n 1 ° Pr,([a]) = P tl ' ([>] 1 * a * r]]) — [r] * r] 1 * a * r, * t) X ] = [a]. 

Thus, P " 1 o P r] = idjrj ( jsc,jc 0 ) • From the symmetry, we have P r , o P~ x = id„ l( x, Al ). 
We find from exercise 2.3 that P, ; is one to one and onto. □ 

Accordingly, if X is arcwise connected, we do not need to specify the base 
point since rt\ (X, * 0 ) = n\ ( X , x\ ) for any * 0 , xi e X, and we may simply write 

7Tl(X). 

Exercise 4.1. (1) Let i] and f be paths from vo to xi, such that ~ £. Show that 

Pr, = P<- 

(2) Let 1 1 and f be paths such that 77 (1) = ^(0). Show that P = P^ o P () . 

4.2.2 Homotopic invariance of fundamental groups 

The homotopic equivalence of paths and loops is easily generalized to arbitrary 
maps. Let /, g : X -> Y be continuous maps. If there exists a continuous map 
F : X x I — > Y such that P(x, 0) = f{x) and F(x . 1) = g(x), f is said to be 
homotopic to g, denoted by / ~ g. The map F is called a homotopy between / 
and g. 

Definition 4.6. Let X and Y be topological spaces. X and Y are of the same 
homotopy type, written as X ~ Y, if there exist continuous maps / : X -> Y 
and g : Y — > X such that / o g ~ idy and g o f ~ idx- The map / is 
called the homotopy equivalence and g, its homotopy inverse. [Remark: If X is 
homeomorphic to Y , X and Y are of the same homotopy type but the converse is 
not necessarily true. For example, a point {/?} and the real line M are of the same 
homotopy type but {/?} is not homeomorphic to IK.] 

Proposition 4.2. ‘Of the same homotopy type’ is an equivalence relation in the 
set of topological spaces. 

Proof. Reflectivity: X ~ X where idx is a homotopy equivalence. Symmetry: 
Let X ~ Y with the homotopy equivalence f : X Y. Then Y — X, the 
homotopy equivalence being the homotopy inverse of /. Transitivity: Let X ~ Y 
and Y ~ Z. Suppose f : X — > Y, g : Y — > Z are homotopy equivalences and 
f , :Y—>X,g , :Z—>Y, their homotopy inverses. Then 

(g 0 /)(/' 0 g') = gif ° f)g ’ ~ g o idy o g' = g o g' ~ id z 
if o g'Kg o /) = f{g' o g)f ~ /' o idy o / = /' o / ~ id x 


from which it follows X ~ Z. 


□ 



Figure 4.7. The circle R is a retract of the annulus X. The arrows depict the action of the 
retraction. 


One of the most remarkable properties of the fundamental groups is that two 
topological spaces of the same homotopy type have the same fundamental group. 

Theorem 4.3. Let X and Y be topological spaces of the same homotopy type. If 
/ : X -> Y is a homotopy equivalence, tti(X, xo) is isomorphic to tt\{Y, fix o)). 

The following corollary follows directly from theorem 4.3. 

Corollary 4.1. A fundamental group is invariant under homeomorphisms, and 
hence is a topological invariant. 

In this sense, we must admit that fundamental groups classify topological 
spaces in a less strict manner than homeomorphisms. What we claim at most is 
that if topological spaces X and Y have different fundamental groups, X cannot 
be homeomorphic to Y . Note, however, that the homotopy groups including the 
fundamental groups have many applications to physics as we shall see in due 
course. We should stress that the main usage of the homotopy groups in physics 
is not to classify spaces but to classify maps or field configurations. 

It is rather difficult to appreciate what is meant by ‘of the same homotopy 
type’ for an arbitrary pair of X and Y . In practice, however, it often happens that 
Y is a subspace of X . We then claim that X ~ Y if Y is obtained by a continuous 
deformation of X. 

Definition 4.7. Let R (^0) be a subspace of X. If there exists a continuous map 
/ : X — > R such that / \r = id/,', R is called a retract of X and / a retraction. 

Note that the whole of X is mapped onto R keeping points in R fixed. 
Figure 4.7 is an example of a retract and retraction. 


Figure 4.8. The circle R is not a deformation retract of A. 


Definition 4.8. Let R be a subspace of X. If there exists a continuous map 
H:Xx/^X such that 

H{x,0) — .x H(x,l)eR for any xe A (4.5) 

H(x, t) — x for any x e R and any tel. (4.6) 

The space R is said to be a deformation retract of X. Note that H is a homotopy 
between idx and a retraction / : A — > R, which leaves all the points in R fixed 
during deformation. 

A retract is not necessarily a deformation retract. In figure 4.8, the circle R 
is a retract of X but not a deformation retract, since the hole in X is an obstruction 
to continuous deformation of idx to the retraction. Since X and R are of the same 
homotopy type, we have 

7ri(Z, a) = tt\(R, a) a e R. (4.7) 

Example 4.1. Let X be the unit circle and Y be the annulus, 

X = {e i0 |O < 6 < 2tt] 

Y = {re i0 10 < 6 < 2n, \ < r < §} 

see figure 4.7. Define / : X Y by /(e 10 ) = e 10 and g 
g(re ld ) = e 10 . Then fog: re 10 m- e 10 and g o f : e 10 m* e 10 
fog ~ idy and g o f = idx- There exists a homotopy 

H(re ie , t) = {1 + (r - 1)(1 - r)}e i0 

which interpolates between idx and / o g. keeping the points on X fixed. 
Hence, A is a deformation retract of Y. As for the fundamental groups we have 
7T i (X, a) = 7Ti (T, a) where a e X. 


(4.8) 

(4.9) 

: Y A by 
. Observe that 


Definition 4.9. If a point a e X is a deformation retract of X, X is said to be 

contractible. 

Let c a : X {a} be a constant map. If X is contractible, there exists a 
homotopy H : X x / — > X such that H(x, 0) = c a (x) — a and H (x , 1 ) = 
idx(x) = x for any x e X and, moreover, H(a,t ) = a for any tel. The 
homotopy H is called the contraction. 

Example 4.2. X = K" is contractible to the origin 0. In fact, if we define 
H : W x / — » K by H(x, t ) — tx, we have (i) H(x, 0) = 0 and H (x, 1) = x 
for any x e X and (ii) H{ 0, 1) = 0 for any tel. Now it is clear that any convex 
subset of M" is contractible. 

Exercise 4.2. Let D 2 = {(x,y) e IR 2 |x 2 + y 2 < 1}. Show that the unit circle 
S' 1 is a deformation retract of D 2 — {0}. Show also that the unit sphere S n is a 
deformation retract of D n+l — {0}, where D" +1 = {x e K" +1 ||x| < 1}. 

Theorem 4.4. The fundamental group of a contractible space X is trivial, 
ni(X,xo) = {e}. In particular, the fundamental group of K" is trivial, 
JTi (E”,x 0 ) = {e}. 

Proof. A contractible space has the same fundamental group as a point { p\ and a 
point has a trivial fundamental group. □ 

If an arcwise connected space X has a trivial fundamental group, X is said 
to be simply connected, see section 2.3. 

4.3 Examples of fundamental groups 

There does not exist a routine procedure to compute the fundamental groups, 
in general. However, in certain cases, they are obtained by relatively simple 
considerations. Here we look at the fundamental groups of the circle S l and 
related spaces. 

Let us express 5 1 as {z e <C||z| = 1}. Define a map p : IK — > S 1 by 
p : x i->- exp(ix). Under p. the point 0 e M is mapped to 1 e .S' 1 , which is 
taken to be the base point. We imagine that 11 wraps around .S' 1 under p, see 
figure 4.9. If x, y e 1 satisfies x — y = 2jtm(m e Z), they are mapped to the 
same point in S l . Then we write x ~ y. This is an equivalence relation and the 
equivalence class [x] = {y|x — y = 2 ttot for some m e Z} is identified with 
a point exp(ix) e S 1 . It then follows that S l = ffi/2^Z. Let f : M — >■ M be 
a continuous map such that /( 0) = 0 and f(x + 2 n) ~ /(x). It is obvious 
that fix + 2n) = f(x) + 2mt for any x e ffi, where n is a fixed integer. If 
x ~ y (x — y = 2nm), we have 

f(x) - f{y) = fly + 2tt m) - fly) 

— fiy) + 2.jtmn — fly) = 2jtmn 


T(x + 2n) 





4rr 


2n 


0 = 7(0) 


1 = f( 1) 


Figure 4.9. The map p : R -*■ S 1 defined by x \-x exp(ix) projects x + 2m n to the same 
point on S 1 , while / : R — » R. such that /( 0) =0 and / (x + 2?r) = / (x) + 2?!7r for 
fixed n, defines a map / : 5 1 — > 5 1 . The integer n specifies the homotopy class to which 
/ belongs. 


hence f(x) ~ /(y). Accordingly, / : M — >• M uniquely defines a continuous 
map / : 1R/27TZ -> M/27rZ by /([x]) — p o f(x), see figure 4.9. Note that / 
keeps the base point 1 e 5 1 fixed. Conversely, given a map / : .S' 1 — > .S 1 , which 
leaves 1 € 5 1 fixed, we may define a map / : M — >• M such that /( 0) = 0 and 
f(x + 2n) = f (x) + 2itn. 

In summary, there is a one-to-one correspondence between the set of maps 
from S 1 to S 1 with /( 1) = 1 and the set of maps from ffi to M such that / (0) = 0 
and fix + 2tt) = fix) + 2nn. The integer n is called the degree of / and is 
denoted by deg(/). While x encircles .S' 1 once, fix) encircles .S' 1 n times. 

Lemma 4.2. (1) Let f,g : 5 1 — > 5 1 such that /( 1) = g(l) = 1. Then 
deg if) — deg(g) if and only if / is homotopic to g. 

(2) For any n e Z, there exists a map / : 5 1 -> 5 1 such that deg(/) = n. 



Proof. (1) Let deg(/) = deg(g) and /, g : M -> M be the corresponding maps. 
Then F(x , t) = t f(x) + (1 — t)g(x) is a homotopy between f(x) and g(x). It 
is easy to verify that F = p o F is a homotopy between / and g. Conversely, 
if / ~ 8 : S l •^ l > there exists a homotopy F : .S' 1 x / — > .V 1 such that 
F(l.t) — I for any t e 1 . The corresponding homotopy F : I x / -> I 
between / and g satisfies F(x + 2ir , t) — F(x, t ) + 2 nn for some n e Z. Thus, 
deg (/) =_deg(g). 

(2) / : x i-^- nx induces a map / : 5 1 — > S 1 with deg (/) — n. □ 

Lemma 4.2 tells us that by assigning an integer deg(/) to a map / : S l —> 5 1 
such that /( 1) = 1, there is a bijection between tt\ (.S' 1 . 1) and Z. Moreover, this 
is an isomorphism. In fact, for /, g : S l — »■ 5 1 , / * g, defined as a product of 
loops, satisfies deg(/ * g) — deg (/) + deg(g). [Let / (x + 2n) — f(x) + 2: xn 
and g{x + 2n) — g(x) + 2jrm. Then / * g(x + 2i r) — f * g(x) + 2 n(m + n). 
Note that * is not a composite of maps but a product of paths.] We have finally 
proved the following theorem. 

Theorem 4.5. The fundamental group of S 1 is isomorphic to Z, 

TTi C S l ) = Z. (4.10) 

[Since .S' 1 is arcwise connected, we may drop the base point.] 

Although the proof of the theorem is not too obvious, the statement itself is 
easily understood even by children. Suppose we encircle a cylinder with an elastic 
band. If it encircles the cylinder n times, the configuration cannot be continuously 
deformed into that with m (fn) encirclements. If an elastic band encircles a 
cylinder first n times and then m times, it encircles the cylinder n + m times in 
total. 

4.3.1 Fundamental group of torus 

Theorem 4.6. Let X and Y be arcwise connected topological spaces. Then 
7i 1 (X x Y, (jco, >’o)) is isomorphic to it\ ( X , xo) © 7t\ (Y, yo). 

Proof. Define projections p\ : X x Y — »• X and p 2 : X x Y — > Y. If a is a 
loop in X x Y at (xo, y o), «i = piipi) is a loop in X at xo, and 0 L 2 = pi(oi) 
is a loop in Y at yo- Conversely, any pair of loops a\ of X at xo and 012 of Y 
at yo determines a unique loop a = (ot\, 012 ) of X x Y at (xo. yo)- Define a 
homomorphism : 7ri(X x Y, (xo, yo)) — * 7ti(X, xo) © 7ri(T, yo) by 

<K[a]) = ([at], [« 2 ])- 


By construction q> has an inverse, hence it is the required isomorphism and 
7T1 (. X X Y, (xo, yo)) = 7T 1 (X, xo) © TTi (T, yo). □ 



(4.11) 


Example 4.3. (1) Let T 2 — S l x .S’ 1 be a torus. Then 

© 7Ti (S 1 ) = Z © Z. 

Similarly, for the n -dimensional torus 

T n = S 1 x S l x • • • x S l 

^ Y 

n 


we have 


7Ti(T' l ) = Z®Z®---®Z. 

~~ v ^ 

n 


(2) Let A = S 1 x ffi be a cylinder. Since tti(IR) = {e}, we have 


(4.12) 


7ri(Z) = Z © {e} = Z. 


(4.13) 


4.4 Fundamental groups of polyhedra 

The computation of fundamental groups in the previous section was, in a sense, ad 
hoc and we certainly need a more systematic way of computing the fundamental 
groups. Fortunately if a space X is triangulable, we can compute the fundamental 
group of the polyhedron K, and hence that of X by a routine procedure. Let us 
start with some aspects of group theories. 

4.4.1 Free groups and relations 

The free groups that we define here are not necessarily Abelian and we employ 
multiplicative notation for the group operation. A subset X — {xj } of a group G 
is called a free set of generators of G if any element g e G — {e} is uniquely 
written as 

g = x\' x l 2 ■■■ x l £ (4.14) 

where n is finite and ik e Z. We assume no adjacent Xj are equal; Xj ^ Xj+ 1 . 
If ij — 1, xj 1 is simply written as Xj . If i j = 0, the term Xj° should be dropped 
from g. For example, g — a^b^cb 2 is acceptable but h = a 2 a~ 2 cb° is not. If 
each element is to be written uniquely, h must be reduced to h = ac. If G has a 
free set of generators, it is called a free group. 

Conversely, given a set X , we can construct a free group G whose free set of 
generators is X. Let us call each element of X a letter. The product 

w = x'^xy- ■ ■ ■ x]" (4.15) 

is called a word, where xj e X and ij e Z. If ij ^ 0 and xj ^ Xj+ \ the word is 
called a reduced word. It is always possible to reduce a word by finite steps. For 
example, 

a~ 2 b~ 2 b 3 a 4 b 3 c~ 2 c 4 — a~ 2 b°a 4 b 3 c 2 = a 2 b 3 c 2 . 



A word with no letters is called an empty word and denoted by 1 . For example, 
it is obtained by reducing w = a 0 . 

A product of words is defined by simply juxtaposing two words. Note that a 
juxtaposition of reduced words is not necessarily reduced but it is always possible 
to reduce it. For example, if v — a 2 c~ 2 b 2 and w — b~ 2 c 2 b , the product vw is 
reduced as 

vw = a 2 c -3 b 2 b~ 2 c 2 b 2 — a 2 c~ 3 c 2 b 2 = a 2 c~ l b 2 . 


Thus, the set of all reduced words form a well-defined free group called the free 
group generated by X, denoted by F[Z], The multiplication is the juxtaposition 
of two words followed by reduction, the unit element is the empty word and the 
inverse of 


is 


w 1 = x„ 



Exercise 4.3. Let X = {a}. Show that the free group generated by X is 
isomorphic to Z . 


In general, an arbitrary group G is specified by the generators and certain 
constraints that these must satisfy. If {xk} is the set of generators, the constraints 
are most commonly written as 

'■=«■■■*£ = 1 ( 4 - 16 ) 

and are called relations. For example, the cyclic group of order n generated by x 
(in multiplicative notation) satisfies a relation x n = 1 . 

More formally, let G be a group which is generated by X = {xk}. Any 
element g e G is written as g = xj 1 xi, 2 ■ ■ ■ x’„" , where we do not require that 
the expression be unique (G is not necessarily free). For example, we have 
x 1 — x" +1 in Z. Let E[X] be the free group generated by X. Then there is a 
natural homomorphism ip from F[X] onto G defined by 

*1 X 2 • • • X n x \ x 2 ■ ■ ■ x n e G - ( 4 - 17 ) 

Note that this is not an isomorphism since the LHS is not unique, (p is onto since X 
generates both F[X] and G. Although /•’[A] is not isomorphic to G, F[X ]/ ker (p 
is (see theorem 3.1), 

F[X\/keup = G. (4.18) 

In this sense, the set of generators X and ker <p completely determine the group 
G. [ker <p is a normal subgroup. Lemma 3.1 claims that ker <p is a subgroup 
of F[A]. Let r e ker<p, that is, r e ,F[A] and (p{r ) = 1. For any element 
x e F[Z], we have cp(x~ l rx ) = (p(x~ l )(p(r)(p(x) = (p{x)~ 1 (p(r)(p{x) = 1, 
hence x~ l rx e ker <p.] 



In this way, a group G generated by A is specified by the relations. The 
juxtaposition of generators and relations 

Ol ,...,x p \r\,...,r q ) (4.19) 

is called the presentation of G. For example, Z„ = (x; x") and Z = (x; 0). 

Example 4.4. Let Z © Z = {x n y m \n, m e Z} be a free Abelian group generated 
by X — {x, y }. Then we have xy — yx. Since xyx~ l y~ l — 1, we have a relation 
r — xyx _i y~ l . The presentation of Z © Z is (x, y : xyx~ 1 y~ 1 ). 

4.4.2 Calculating fundamental groups of polyhedra 

We shall be sketchy here to avoid getting into the technical details. We 
shall follow Armstrong (1983); the interested reader should consult this book 
or any textbook on algebraic topology. As noted in the previous chapter, a 
polyhedron | K\ is a nice approximation of a given topological space X within 
a homeomorphism. Since fundamental groups are topological invariants, we have 
tt\(X) = 7Ti(|Aj). We assume A isanarcwise connected space and drop the base 
point. Accordingly, if we have a systematic way of computing 7Ti (| /C |), we can 
also find n\(X). 

We first define the edge group of a simplicial complex, which corresponds to 
the fundamental group of a topological space, then introduce a convenient way of 
computing it. Let / : | K\ — > A be a triangulation of a topological space A. If we 
note that an element of the fundamental group of A can be represented by loops 
in A, we expect that similar loops must exist in | K | as well. Since any loop in | K \ 
is made up of 1-simplexes, we look at the set of all 1-simplexes in | K\, which can 
be endowed with a group structure called the edge group of K . 

An edge path in a simplicial complex A is a sequence vovi . . . Vk of vertices 
of |A|, in which the consecutive pair u,-u,-+ 1 is a 0- or 1-simplex of \ K\. [For 
technical reasons, we allow the possibility u,- = v,-+i, in which case the relevant 
simplex is a 0-simplex Vj = i;, + i .] If vq = Vk (—v), the edge path is called 
an edge loop at v. We classify these loops into equivalence classes according to 
some equivalence relation. We define two edge loops a and /3 to be equivalent 
if one is obtained from the other by repeating the following operations a finite 
number of times. 

(1) If the vertices u, v and w span a 2-simplex in K, the edge path uvw may 
be replaced by uw and vice versa ; see figure 4.10(a). 

(2) As a special case, if u = w in (1), the edge path uvw corresponds to 
traversing along u v first then reversing backwards from v to w = u. This edge 
path uvu may be replaced by a 0-simplex u and vice versa, see figure 4. 1 0(b). 

Let us denote the equivalence class of edge loops at v, to which W] . . . Vk-iv 
belongs, by {viq . . . Vk-\v}. The set of equivalence classes forms a group under 
the product operation defined by 

{vu | . . . Uk-lV } * {l>iq . . . l>;_i l>} = {l>Mi . . . Uk-\VV\ . . . l>;_i l>}. (4.20) 


u = w 


Figure 4.10. Possible deformations of the edge loops. In (a), uvw is replaced by uw. In 
( b ), uvu is replaced by u. 


The unit element is an equivalence class {i>} while the inverse of {vv\ . . . Vk-i v} 
is . . . v i v | . This group is called the edge group of K at v and denoted by 

E(K\ v). 

Theorem 4.7. E (K ; v) is isomorphic to n\ (| K | ; v). 

The proof is found in Armstrong (1983), for example. This isomorphism 
q> : E(K\ v) -> 7Ti ( | A' | ; v) is given by identifying an edge loop in K with a loop 
in \K \. To find E ( K ; v ) , we need to read off the generators and relations. Let L 
be a simplicial subcomplex of K, such that 

(a) L contains all the vertices (0-simplexes) of K ; 

(b) the polyhedron \ L\ is arcwise connected and simply connected. 

Given an arcwise-connected simplicial complex K, there always exists a 
subcomplex L that satisfies these conditions. A one-dimensional simplicial 
complex that is arcwise connected and simply connected is called a tree. A tree 
Tm is called the maximal tree of K if it is not a proper subset of other trees. 

Lemma 4.3. A maximal tree Tm contains all the vertices of K and hence satisfies 
conditions (a) and (b) above. 

Proof. Suppose Tm does not contain some vertex w. Since K is arcwise 
connected, there is a 1-simplex vw in K such that v e 7 m and w f Tm- Tm U 
{uu;} U {u>} is a one-dimensional subcomplex of K which is arcwise connected, 
simply connected and contains Tm- which contradicts the assumption. □ 

Suppose we have somehow obtained the subcomplex L. Since \L\ is simply 
connected, the edge loops in \L\ do not contribute to E( K : v). Thus, we can 
effectively ignore the simplexes in L in our calculations. Let i>o (—v), v \ .... , v n 
be the vertices of K. Assign an ‘object’ gij for each ordered pair of vertices u; , vj 
if ( VjVj ) is a 1-simplex of K. Let Gi K: L) be a group that is generated by all g,j. 
What about the relations? We have the following. 



(1) Since we ignore those simplexes in L, we assign gij = 1 if (vjVj) e L. 

(2) If (vi v j Vk) is a 2-simplex of K, there are no non-trivial loops around v,- vj Vk 
and we have the relation gijgjkgki = 1- 

The generators \g/j } and the set of relations completely determine the group 
G(K; L). 

Theorem 4.8. G(K\ L) is isomorphic to E(K\ v ) — ni{\K\\ v). 

In fact, we can be more efficient than is apparent. For example, ga should 
be set equal to 1 since ga corresponds to the vertex i>; which is an element of 
L. Moreover, from gijgji = ga = 1, we have gij — gT 1 . Therefore, we only 
need to introduce those generators gij for each pair of vertices Vj, vj such that 
( VjVj ) e K — L and i < j. Since there are no generators gij such that (i jvj) e L, 
we can ignore the first type of relation. If ( v/VjVk ) is a 2-simplex of K — L such 
that i < j < k , the corresponding relation is uniquely given by gijgjk — gik 
since we are only concerned with simplexes ( V{Vj ) such that i < j . 

To summarize, the rules of the game are as follows. 

(1) First, find a triangulation / : \K \ — > X. 

(2) Find the subcomplex L that is arcwise connected, simply connected and 
contains all the vertices of K. 

(3) Assign a generator gjj to each 1 -simplex ( VjVj ) of K — L, for which i < j . 

(4) Impose a relation gijgjk = gik if there is a 2-simplex (vtVjVk) such that 
i < j < k. If two of the vertices Vj , Vj and Vk form a 1 -simplex of L, the 
corresponding generator should be set equal to 1 . 

(5) Now n\(X) is isomorphic to G(K\ L) which is a group generated by {gij} 
with the relations obtained in (4). 

Let us work out several examples. 

Example 4.5. From our construction, it should be clear that E (K ; v) and G(K : L) 
involve only the 0-, 1- and 2-simplexes of K. Accordingly, if K ,2] denotes a 2- 
skeleton of K. which is defined to be the set of all 0-, 1- and 2-simplexes in K, 
we should have 

7ri(|K|) = 7ri(|^ (2) |). (4.21) 

This is quite useful in actual computations. For example, a 3-simplex and its 
boundary have the same 2-skeleton. A 3-simplex is a polyhedron | AT | of the solid 
ball Z) 3 , while its boundary \L\ is a polyhedron of the sphere S 2 . Since />’ is 
contractible, jti{\K\) = {e}. From (4.21) we find 7Ti(>S 2 ) = 7Ti (| AT | ) = {e}. In 
general, for n > 2, the in + 1 (-simplex cr„+i and the boundary of er„+i have the 
same 2-skeleton. If we note that cr„+i is contractible and the boundary of a n+ \ is 
a polyhedron of S n , we find the formula 


= {e} 


n > 2. 


(4.22) 



Figure 4.11. A triangulation of a 3-bouquet. The bold lines denote the maximal tree L. 


Example 4.6. Let K = {iq, iq, i> 3 , (tqtq), (iqu 3 ), (iqtq)} be a simplicial 
complex of a circle 5*. We take tq as the base point. A maximal tree may be 
L = { iq, iq, i> 3 , (iq v 2 ), (iq 1 ) 3 )}. There is only one generator £ 23 - Since there are 
no 2-simplexes in K , the relation is empty. Hence, 


TTi (S l ) = G(K ■ L) = (g 23 ; 0) = Z (4.23) 


in agreement with theorem 4.5. 

Example 4.7. An H-bouquet is defined by the one-point union of n circles. For 
example, figure 4.11 is a triangulation of a 3-bouquet. Take the common point 
v as the base point. The bold lines in figure 4.11 form a maximal tree L. The 
generators of G ( K : L) are g 12 , g 3 4 and g^. There are no relations and we find 

n\ (3-bouquet) = G(K\ L) — (x, y, z; 0). (4.24) 

Note that this is a free group but not free Abelian. The non-commutativity can 
be shown as follows. Consider loops a and P at v encircling different holes. 
Obviously the product a*p*a~ l cannot be continuously deformed into P, hence 
[a]*[P]*[a]~ l ^[p],or 


[a] * m / m * [«]• (4-25) 

In general, an n-bouquet has n generators gn, ■ ■ ■ , g2n-\ 2n and the 
fundamental group is isomorphic to the free group with n generators with no 
relations. 



0 1 a 2 x 0 



0 12 0 


Figure 4.12. A triangulation of the torus. 

Example 4.8. Let D 2 be a two-dimensional disc. A triangulation K of D 2 is given 
by a triangle with its interior included. Clearly K itself may be L and K — L is 
empty. Thus, we find n\(K) = {e}. 


Example 4.9. Figure 4.12 is a triangulation of the torus T 2 . The shaded area is 
chosen to be the subcomplex L. [Verify that it contains all the vertices and is both 
arcwise and simply connected.] There are 1 1 generators with ten relations. Let us 
take x = go 2 and y = go 4 and write down the relations 

(a) g02 £27 = goi -> goi — x 

X 1 

(b) g03 g37 = 807 g37 — x 

1 .V 

(c) g37 g 78 = g38 -> g38 = 

X 1 

(d) £34 £48 = £38 848 = X 

1 .V 

(e) £24 £48 = £28 £24-^ = £28 

X 

(f) £02 £24 = 


X 


8 04 

y 


xg 24 = y 



(g) 

§04 

§46 = 

§06 

§06 = y 


y 

1 



(h) 

§01 

§16 = 

§06 

§16 = y 


1 


y 


(i) 

§16 

§68 

§18 

II 

00 

t 


y 

1 



a) 

§12 

§28 = 

§18 

->• §28 = y 


1 


y 


(e) and (f) that x 1 yx — 

§28- 

We finally have 


§ 02 = g 07 = g 37 = §38 = §48 = * 
g04 = g 06 = g 16 = §18 = §28 = V 
§24 = x~ l y 

with a relation x ~ 1 yx = y or 

xyx~ l y~ l = 1. (4.26) 

This shows that G(K: L) is generated by two commutative generators (note 
xy = yx), hence (cf example 4.4) 

G(K\ L) = (x, y; xyx~ l y~ l ) = Z © Z (4.27) 

in agreement with (4.1 1). 

We have the following intuitive picture. Consider loops a = 0 — > 1 — »■ 
2 — 0 and = 0 — > 3 — > 4 — > 0. The loop a is identified with x = g 02 since 
§12 = §01 = 1 and p with y = §04. They generate tz\(T 2 ) since a and ft are 
independent non-trivial loops. In terms of these, the relation is written as 

a * p * a~ l * p~ x ~ c v (4.28) 

where c v is a constant loop at v, see figure 4.13. 

More generally, let be the torus with genus §. As we have shown in 
problem 2.1, is expressed as a subset of K 2 with proper identifications at 
the boundary. The fundamental group of lb, is generated by 2 § loops a; , Pi 
(1 < i < §). Similarly, to (4.28), we verify that 

g 

]~ [(«; * Pi * up 1 * P~ X ) ~ c v (4.29) 

i=i 

If we denote the generators corresponding to a, by x,- and p, by y; , there is only 
one relation among them, 

g 

ncwrV) = '■ 

1=1 


(4.30) 




Figure 4.14. A triangulation of the Klein bottle. 


Exercise 4.4. Figure 4. 14 is a triangulation of the Klein bottle. The shaded area is 
the subcomplex L. There are 1 1 generators and ten relations. Take x = go 2 and 
y = ,go 4 and write down the relations for 2-simplexes to show that 

7Ti (Klein bottle) = (x, y; xyxy -1 ). (4.31) 


Example 4.10. Figure 4.15 is a triangulation of the projective plane WP 2 . The 
shaded area is the subcomplex L. There are seven generators and six relations. 


3 



Figure 4.15. A triangulation of the projective plane. 


Let us take x — g 23 and write down the relations 


(a) 

£23 

£34 

= £24 -»• 

£24 = X 


X 

1 



(b) 

824 

£46 

= £26 

£26 = X 


X 

1 



(c) 

8 12 

£26 

= £16 

8 16 = X 


1 

X 



(d) 

£13 

£36 

= £16 -> 

£36 = * 


1 


X 


(e) 

£35 

£56 

— £36 

£35 = * 



1 

X 


(f) 

£23 

£35 

= 8 25 

X 2 = 1. 


X 

X 

1 



Hence, we find that 

tti(IRP 2 ) = (x; x 2 ) = Z 2 . (4.32) 

Intuitively, the appearance of a cyclic group is understood as follows. 
Figure 4.16(a) is a schematic picture of MP 2 . Take loops a and fi. It is easy 
to see that a is continuously deformed to a point, and hence is a trivial element of 
7 T 1 (IRP 2 ). Since diametrically opposite points are identified in ffiLP 2 , /3 is actually 



Figure 4.16. (a) a is a trivial loop while the loop ft cannot be shrunk to a point, (b) ft * ft 
is continuously shrunk to a point. 


a closed loop. Since it cannot be shrank to a point, it is a non-trivial element of 
jti (HP 2 ). What about the product? ft * ft is a loop which traverses from P to Q ~ 
P twice. It can be read off from figure 4.16 (b) that ft * ft is continuously shrunk 
to a point, and thus belongs to the trivial class. This shows that the generator x, 
corresponding to the homotopy class of the loop /3, satisfies the relation x 2 = 1, 
which verifies our result. 

The same pictures can be used to show that 

7ri(ffiP 3 ) = Z 2 (4.33) 

where RP 3 is identified as .S' 3 with diametrically opposite points identified, 
R P 3 = S 3 /(x ~ —x). If we take the hemisphere of S 3 as the representative, 
1RP 3 can be expressed as a solid ball D 3 with diametrically opposite points on the 
surface identified. If the discs D 2 in figure 4.16 are interpreted as solid balls D 3 , 
the same pictures verify (4.33). 

Exercise 4.5. A triangulation of the Mobius strip is given by figure 3.8. Find the 
maximal tree and show that 


7t \ (Mobius strip) = Z. (4.34) 

[Note: Of course the Mobius strip is of the same homotopy type as S 1 , hence 
(4.34) is trivial. The reader is asked to obtain this result through routine 
procedures.] 

4.4.3 Relations between Hi(K) and jti(|A |) 

The reader might have noticed that there is a certain similarity between the first 
homology group H\{K) and the fundamental group tT](\K\). For example, the 
fundamental groups of many spaces (circle, disc, n-spheres, torus and many more) 
are identical to the corresponding first homology group. In some cases, however, 
they are different: H\ (2-bouquet) = Z © Z and n\ (2-bouquet) = (x,y : 0), for 


example. Note that Hi (2-bouquet) is a free Abelian group while 7t\ (2-bouquet) 
is a free group. The following theorem relates 7ri(|K|) to H\(K). 

Theorem 4.9. Let K be a connected simplicial complex. Then I! \(K ) is 
isomorphic to 7Ti (| K\)/F, where F is the commutator subgroup (see later) of 

Let G be a group whose presentation is ( x,;r m ). The commutator 
subgroup F of G is a group generated by the elements of the form XiXjxT l xJ l . 
Thus, G/F is a group generated by [x, } with the set of relations { r m ) and 
{xiXjxT l x~ 1 }. The theorem states that if jri(|K|) = (x,- : r m ), then H[(K ) = 

(xj : r m ,XjXjxT l Xj). For example, from n\ (2-bouquet) = (x, y : 0), we find 

7ri(2-bouquet)/F = (x, y; xyx~ l y~ l ) = Z © Z 

which is isomorphic to Hi (2-bouquet). 

The proof of theorem 4.9 is found in Greenberg and Harper (1981) and also 
outlined in Croom (1978). 

Example 4.11. From ti\ (Klein bottle) = (x, y; xyxy~ l ), we have 

7Ti(Klein bottle)//*’ = (x, y; xvxy~ l , xyx~ y~ l ). 

Two relations are replaced by x 2 = 1 and xyx -1 v _1 = 1 to yield 

tt \ (Klein bottle)/F = (x, y; xyx _1 y _1 , x 2 ) = Z © Z 2 
= Hi (Klein bottle) 

where the factor Z is generated by y and Z 2 by x. 

Corollary 4.2. Let X be a connected topological space. Then tc\ (X ) is isomorphic 
to Hi ( X ) if and only if 7T\ (X) is commutative. In particular, if tc\ {X) is generated 
by one generator, tc\(X) is always isomorphic to H\(X). [Use theorem 4.9.] 

Corollary 4.3. If X and Y are of the same homotopy type, their first homology 
groups are identical: H\(X) — ll\(Y). [Use theorems 4.9 and 4.3.] 

4.5 Higher homotopy groups 

The fundamental group classifies the homotopy classes of loops in a topological 
space X. There are many ways to assign other groups to X. For example, we may 
classify homotopy classes of the spheres in X or homotopy classes of the tori in 
X. It turns out that the homotopy classes of the sphere S" (n > 2) form a group 
similar to the fundamental group. 



4.5.1 Definitions 


Let 7" (« > 1) denote the unit n-cube 7 x • • • x 7, 

7" ={(si,...,j„)|0<Si < 1(1 <i < n)}. (4.35) 

The boundary 3 7" is the geometrical boundary of I", 

3 7" ={(s l ,...,s n ) e 7"| some sj = 0orl}. (4.36) 

We recall that in the fundamental group, the boundary 3/ of I — [0, 1] is mapped 
to the base point xq. Similarly, we assume here that we shall be concerned with 
continuous maps a : I" — > X , which map the boundary 37" to a point xq e X. 
Since the boundary is mapped to a single point xo, we have effectively obtained 
.S'" from 7"; cf figure 2.8. If 7"/37" denotes the cube 7" whose boundary 37" is 
shrunk to a point, we have 7"/37" = 5". The map a is called an n-loop at xq. A 
straightforward generalization of definition 4.4 is as follows. 

Definition 4.10. Let X be a topological space and a, />:/"—> V be n-loops at 
xq e X. The map a is homotopic to ft, denoted by a ~ /J, if there exists a 
continuous map F : I' 1 x I -+ X such that 

F(si, ..., s n , 0) = a(si s n ) (4.37a) 

F(s\, ..., s n , 1) = ..., s n ) (4.37b) 

T^si , . . . , s n ,t) — x o for (si, ... , s n ) e 37", tel. (4.37c) 


F is called a homotopy between a and /). 


Exercise 4.6. Show that a ~ /3 is an equivalence relation. The equivalence class 
to which a belongs is called the homotopy class of a and is denoted by [a] . 


Let us define the group operations. The product a * p of n -loops a and f> is 
defined by 


a * P(s i, . . . , s„) 


I a(2si , . . . , s„) 0 < si < j 

fi(2si - 1 s„) \ < -si < 1 . 


(4.38) 


The product a * fi looks like figure 4.17(a) in X. It is helpful to express it as 
figure 4.17(h). If we define a -1 by 

a _1 (ii, . .. ,s n ) = a(l - si, . . . , s n ) (4.39) 


it satisfies 

a~ l *a(si, . . . , s n ) ~ a * aT 1 (,V| , . . . , s n ) ~ c T0 (^i, . . . , s n ) (4.40) 

where c xo is a constant n-loop at xo e X, c XQ : (s i, , s n ) i->- xq. Verify that 
both a * /3 and a -1 are n-loops at xq. 



Figure 4.17. A product a * /3 of /7-loops a and fi. 


Definition 4.11. Let X be a topological space. The set of homotopy classes of 
n-loops (« > 1) at xq e X is denoted by it„ (X, xq) and called the /ith homotopy 
group at .ico- 71 n (x, xo) is called the higher homotopy group if n > 2. 

The product a * just defined naturally induces a product of homotopy 
classes defined by 

[a] * [fi] = [cc * 0] (4.41) 

where a and are n-loops at xq. The following exercises verify that this product 
is well defined and satisfies the group axioms. 

Exercise 4.7. Show that the product of //-loops defined by (4.41) is independent 
of the representatives: cf lemma 4.1. 

Exercise 4.8. Show that the nth homotopy group is a group. To prove this, the 
following facts may be verified; cf theorem 4.1. 

(1) ([a]*L8])*[y] = [a]*([i8]*[y]). 

(2) [a] * [c x ] = [c x ] * [a] = [a], 

(3) [a] * [a -1 ] = [c A ], which defines the inverse [a] -1 = [a -1 ]. 

We have excluded rcfiX . xq) so far. Let us classify maps from 7° to X. We 
note 7° = {0} and 37° = 0. Let a, ft : {0} — > X be such that crfO) = x and 
yS (0) = y. We define a ~ jl if there exists a continuous map F : {0} x 7 -» X 
such that 7^(0, 0) — x and F(0, 1) = y. This shows that a ~ ft if and only if 
x and y are connected by a curve in X, namely they are in the same (arcwise) 
connected component. Clearly this equivalence relation is independent of xo and 
we simply denote the zeroth homology group by tto(X). Note, however, that 
jto(X) is not a group and denotes the number of (arcwise) connected components 
of X. 




Figure 4.18. Higher homotopy groups are always commutative, a * p ~ (5 * a. 


4.6 General properties of higher homotopy groups 

4.6.1 Abelian nature of higher homotopy groups 

Higher homotopy groups are always Abelian; for any n -loops a and f$ at xo e X, 
[a] and [/l] satisfy 

M *m = [/]*[«]■ (4.42) 

To verify this assertion let us observe figure 4.18. Clearly the deformation is 
homotopic at each step of the sequence. This shows that a * / ~ ft * a, namely 
[<*]*[£] = [/?]*[«]. 

4.6.2 Arcwise connectedness and higher homotopy groups 

If a topological space X is arcwise connected, 7t n (X . xo) is isomorphic to 
7T„(X, x\ ) for any pair xo, xi e X. The proof is parallel to that of theorem 4.2. 
Accordingly, if X is arcwise connected, the base point need not be specified. 

4.6.3 Homotopy invariance of higher homotopy groups 

Let X and Y be topological spaces of the same homotopy type; see definition 
4.6. If / : X -> Y is a homotopy equivalence, the homotopy group it„ ( X , xo) 
is isomorphic to tt„(Y, /(xo)); cf theorem 4.3. Topological invariance of higher 
homotopy groups is the direct consequence of this fact. In particular, if X is 
contractible, the homotopy groups are trivial: it„ (. X , xo) — {e},n > 1 . 

4.6.4 Higher homotopy groups of a product space 

Let X and Y be arcwise connected topological spaces. Then 

n„(X x Y) = JT„(X) © 7 r„(F) (4.43) 


cf theorem 4.6. 

4.6.5 Universal covering spaces and higher homotopy groups 

There are several cases in which the homotopy groups of one space are given by 
the known homotopy groups of the other space. There is a remarkable property 













between the higher homotopy groups of a topological space and its universal 
covering space. 

Definition 4.12. Let X and X be connected topological spaces. The pair (X, p), 
or simply X, is called the covering space of X if there exists a continuous map 
p : X -> X such that 

(1) p is surjective (onto) 

(2) for each x e X, there exists a connected open set U C X containing 
x, such that p~ l (U) is a disjoint union of open sets in X, each of which is 
mapped homeomorphically onto U by p. 

In particular, if X is simply connected, (X , p) is called the universal 
covering space of X. [Remarks'. Certain groups are known to be topological 
spaces. They are called topological groups. For example SO (n) and SU(«) are 
topological groups. If X and X in definition 4. 12 happen to be topological groups 
and p : X -* X to be a group homomorphism, the (universal) covering space is 
called the (universal) covering group.] 

For example, M is the universal covering space of S l , see section 4.3. Since 
S’ 1 is identified with U(l), K is a universal covering group of U(l) if IR is regarded 
as an additive group. The map p : M -> U(l) may be p : x — >• e l27TX . Clearly p 
is surjective and if U — [e l27TX \x € (xo — 0.1, xo + 0.1)}, then 

p~ l (U) = [J (xo - 0.1 + n , xq + 0.1 + n) 

ne Z 

which is a disjoint union of open sets of TO. It is easy to show that p is also a 
homomorphism with respect to addition in TO and multiplication in U(1 ). Hence, 
(IR, p) is the universal covering group of U(l) = S 1 . 

Theorem 4.10. Let (X.p) be the universal covering space of a connected 
topological space X. If xo e X and xo e X are base points such that p(x o) = xo, 
the induced homomorphism 

p* : n„ (X, x 0 ) tt„ (X, x 0 ) (4.44) 

is an isomorphism for n > 2. [Warning: This theorem cannot be applied if n — 1; 
7i i (TO) = [e] while ^(S 1 ) = Z.] 

The proof is given in Croom (1978). For example, we have 7Z„ (TO) = [e[ 
since IR is contractible. Then we find 

tt,i(S l ) = 7r„(U(l)) = [e] n > 2. (4.45) 

Example 4.12. Let S' 1 — {x e IR' !+1 1 \x\ 2 — 1}. The real projective space IRP" is 
obtained from S' 1 by identifying the pair of antipodal points (x, — x). It is easy to 



see that S' 1 is a covering space of IRP" for n > 2. Since tc\ (S' 1 ) — {e} for n > 2, 
S' 1 is the universal covering space of IRP" and we have 

n n (WP m ) = 7t n (S m ). (4.46) 

It is interesting to note that IRP 3 is identified with SO(3). To see this let 
us specify an element of SO(3) by a rotation about an axis n by an angle 0 
(0 <6 <7 r) and assign a ‘vector’ SI = On to this element. !T2 takes its value in 
the disc D 3 of radius tt. Moreover, mi and —jtn represent the same rotation and 
should be identified. Thus, the space to which Q belongs is a disc IP whose anti- 
podal points on the surface S 2 are identified. Note also that we may express IRP 3 
as the northern hemisphere D 3 of .S' 3 , whose anti-podal points on the boundary S 2 
are identified. This shows that HP 3 is identified with SO(3). 

It is also interesting to see that .S 3 is identified with SU(2). First note that 
any element g e SU(2) is written as 

* = (» t) w2+|i " 2 = 1 - (447) 

If we write a — u + iv and b = x + iy, this becomes 5 3 , 
u 2 + v 2 + x 2 + y 2 = 1. 


Collecting these results, we find 

7r„(SO(3)) = jr„(IRP 3 ) = 7T„(5 3 ) = jr„(SU(2)) n> 2. (4.48) 

More generally, the universal covering group Spin(n) of SO (n) is called the spin 
group. For small n, they are 


Spin(3) = SU(2) 

(4.49) 

Spin(4) = SU(2) x SU(2) 

(4.50) 

Spin(5) = USp(4) 

(4.51) 

Spin(6) = SU(4). 

(4.52) 


Here USp(2/V) stands for the compact group of 2 N x 2 N matrices A satisfying 
A l J A — J , where 


J = 


0 Iff \ 

-In 0 J ' 


4.7 Examples of higher homotopy groups 

In general, there are no algorithms to compute higher homotopy groups jt„(X). 
An ad hoc method is required for each topological space for n > 2. Here, we 
study several examples in which higher homotopy groups may be obtained by 
intuitive arguments. We also collect useful results in table 4.1. 


Table 4.1. Useful homotopy groups. 




jrt 

n 2 

7T 3 

7T4 

^5 

^6 

SO(3) 


z 2 

0 

z 

Z-2 

z 2 

Z\2 

SO(4) 



0 

z + z 

Z 2 + z 3 

Z 2 + Zt 

Z 12 + Z 12 

SO(5) 


Z 2 

0 

z 

Z 2 

z 2 

0 

SO(6) 


Z 2 

0 

z 

0 

z 

0 

SO (n) 

n > 6 

z 2 

0 

z 

0 

0 

0 

U(l) 


Z 

0 

0 

0 

0 

0 

SU(2) 


0 

0 

z 

Z 2 

Z 2 

Z 12 

SU(3) 


0 

0 

z 

0 

z 

z 6 

SU(n) 

n > 3 

0 

0 

z 

0 

z 

0 

S 2 


0 

z 

z 

Z 2 

Z 2 

Z 12 

s 3 


0 

0 

z 

Z 2 

z 2 

Z 12 

5 4 


0 

0 

0 

z 

z 2 

z 2 



0 

0 

z 

0 

0 

z 3 

f 4 


0 

0 

z 

0 

0 

0 

Ee 


0 

0 

z 

0 

0 

0 

El 


0 

0 

z 

0 

0 

0 

E& 


0 

0 

z 

0 

0 

0 


Example 4.13. If we note that n n (X,xo) is the set of the homotopy classes of 
n-loops S" in X, we immediately find that 

7T n (S n ,x 0 ) = Z n> 1 . ( 4 . 53 ) 

If a maps S' 1 onto a point xq e S" , [a] is the unit element 0 e Z. Since both 
I n /dl n and S" are orientable, we may assign orientations to them. If a maps 
I n /dl" homeomorphically to S" in the same sense of orientation, then [a] is 
assigned an element 1 e Z. If a homeomorphism a maps /"/97" onto S" in an 
orientation of opposite sense, [a] corresponds to an element —1. For example, 
let ti — 2. Since I 2 /dI 2 = S 2 , the point in I 2 can be expressed by the polar 
coordinate ( 0 , </>), see figure 4.19. Similarly, X = S 2 can be expressed by the 
polar coordinate (O' , <j>'). Let a : (0, <p ) —> (O ' , <//) be a 2-loop in X. If O' — 0 
and </>' = <p , the point (O', (j) f ) sweeps S 2 once while the point (0, cf>) scans / 2 
once in the same orientation. This 2-loop belongs to the class +1 e JC 2 (S 2 , xq). 
If a : (0, <p) — ^ (O ' , <p') is given by O' = 0 and <p' = 2(f>, the point (O ' , cp') 
sweeps S 2 twice while (0, <p) scans I 2 once. This 2-loop belongs to the class 
2 e n 2 (S 2 , xo). In general, the map (0,<p) m* (0, k(p), k e Z, corresponds to the 
class k of TCiiS 2 , xo). A similar argument verifies (4.53) for general n > 2. 

Example 4.14. Noting that S" is a universal covering space of M.P" for n > 2, we 
find 


7T„(MP' ? ) = jt„(S tt ) = z 


n > 2. 


(4.54) 



Figure 4.19. A point in I 2 may be expressed by polar coordinates (0, <p). 

[Of course this happens to be true for n = 1, since MP 1 = .S' 1 . ] For example, we 
have jt 2 (KLP 2 ) = 7T2(S 2 ) = Z. Since SU(2) = S 3 is the universal covering group 
of SO(3) = MP 3 , it follows from theorem 4.10 that (see also (4.48)) 

7r 3 (SO(3)) = tt 3 (SU(2)) = jr 3 (S 3 ) = Z. (4.55) 

Shankar’s monopoles in superfluid 3 He-A correspond to non-trivial elements 
of these homotopy classes, see section 4.10. 7r 3 (SU(2)) is also employed in the 
classification of instantons in example 9.8. 

In summary, we have table 4.1. In this table, other useful homotopy groups 
are also listed. We comment on several interesting facts. 

(a) Since Spin(4) = SU(2) x SU(2) is the universal covering group of SO(4), 
we have jr„(SO(4)) = 7r„(SU(2)) © 7r„(SU(2)) forrc > 2. 

(b) There exists a map J called the ./-homomorphism J : jtk(SO(n)) — »■ 
Ttk+n (S' 1 ), see Whitehead (1978). In particular, ii k — 1. the homomorphism 
is known to be an isomorphism and we have jti (SO(n)) = tt, i+ \ (S" ). For 
example, we find 

^i(SO(2)) = it 3 (S 2 ) = Z 

tti(SO(3)) = it 4 (S 3 ) = 7t 4 (SU(2)) = ^ 4 (SO(3)) = Z 2 . 

(c) The Bott periodicity theorem states that 


Jr*(U(n)) = w t (SU(n)) = 


\{e) 

[z 


if k is even 
if k is odd 


(4.56) 


for n > (A + 1 ) / 2. Similarly, 


7Tjt(0(n)) = jtic(SO(n)) = 


7Li2 

7L 


if A; = 2, 4, 5,6 (mod 8) 
if A = 0, 1 (mod 8) 
if A = 3 , 7 (mod 8) 


(4.50) 


for n > A + 2. Similar periodicity holds for symplectic groups which we 
shall not give here. 

Many more will be found in appendix A, table 6 of Ito (1987). 


4.8 Orders in condensed matter systems 

Recently topological methods have played increasingly important roles in 
condensed matter physics. For example, homotopy theory has been employed to 
classify possible forms of extended objects, such as solitons, vortices, monopoles 
and so on, in condensed systems. These classifications will be studied in 
sections 4.8-4.10. Here, we briefly look at the order parameters of condensed 
systems that undergo phase transitions. 

4.8.1 Order parameter 

Let H be a Hamiltonian describing a condensed matter system. We assume H is 
invariant under a certain symmetry operation. The ground state of the system need 
not preserve the symmetry of H . If this is the case, we say the system undergoes 
spontaneous symmetry breakdown. 

To illustrate this phenomenon, we consider the Heisenberg Hamiltonian 

H = -jJ2 s i ■ S J + h J2 Si (4.57) 

i 

which describes N ferromagnetic Heisenberg spins {5/}. The parameter J is a 
positive constant, the summation is over the pair of the nearest-neighbour sites 
(;, j ) and h is the uniform external magnetic field. The partition function is 
Z = tre~P H , where ft — 1/7’ is the inverse temperature. The free energy F 
is defined by exp(— ftF) — Z. The average magnetization per spin is 

1 1 dF 

m = — > {St) = (4.58) 

N ^ N/3 dh 

where (...) = tr(. . ,e~^ H )/Z. Let us consider the limit h — >• 0. Although FI 
is invariant under the SO(3) rotations of all Sj in this limit, it is well known that 
m does not vanish for large enough ft and the system does not observe the SO(3) 
symmetry. It is said that the system exhibits spontaneous magnetization and 
the maximum temperature, such that m ^ 0 is called the critical temperature. 



The vector m is the order parameter describing the phase transition between 
the ordered state (in ^ 0) and the disordered state (m = 0). The system is still 
symmetric under SO(2) rotations around the magnetization axis in. 

What is the mechanism underlying the phase transition? The free energy is 
F = ( H ) — TS, S being the entropy. At low temperature, the term TS in F 
may be negligible and the minimum of F is attained by minimizing ( H }, which 
is realized if all 5, align in the same direction. At high temperature, however, the 
entropy term dominates F and the minimum of F is attained by maximizing .S', 
which is realized if the directions of S, are totally random. 

If the system is at a uniform temperature, the magnitude \m\ is independent 
of the position and m is specified by its direction only. In the ground state, m 
itself is expected to be independent of position. It is convenient to introduce 
the polar coordinate (6, cp) to specify the direction of in. There is a one-to-one 
correspondence between m and a point on the sphere S 2 . Suppose m varies as a 
function of position: in — m (x). At each point x of the space, a point (0, <p) of 
S 2 is assigned and we have a map (0(x). <p(x)) from the space to S 2 . Besides 
the ground state (and excited states that are described by small oscillations 
(spin waves) around the ground state) the system may carry various excited 
states that cannot be obtained from the ground state by small perturbations. 
What kinds of excitation are possible depends on the dimension of the space 
and the order parameter. For example, if the space is two dimensional, the 
Heisenberg ferromagnet may admit an excitation called the Belavin-Polyakov 
monopole shown in figure 4.20 (Belavin and Polyakov 1975). Observe that in 
approaches a constant vector ( z in this case) so the energy does not diverge. This 
condition guarantees the stability of this excitation; it is impossible to deform this 
configuration into the uniform one with m far from the origin kept fixed. These 
kinds of excitation whose stability depends on topological arguments are called 
topological excitations. Note that the field m( x) defines a map in : S 2 —*■ S 2 
and, hence, are classified by the homotopy group TC2(S 2 ) = Z. 


4.8.2 Superfluid 4 He and superconductors 

In Bogoliubov’s theory, the order parameter of superfluid 4 He is the expectation 
value 

(0(*)> = W = Ao(x)e iaW (4.59) 

where tp(x) is the field operator. In the operator formalism, 

<p{x) ~ (creation operator) + (annihilation operator) 


from which we find the number of particles is not conserved if ^ (x) ^ 0. This 
is related to the spontaneous breakdown of the global gauge symmetry. The 


M 


a 


m 



Figure 4.20. A sketch of the Belavin-Polyakov monopole. The vector m approaches z as 
1*1 

Hamiltonian of 4 He is 



+ ^ / dxdy(p\y)(t>(y)V(\x - y\)(p\x)(t)(x). (4.60) 

Clearly H is invariant under the global gauge transformation 

<p(x) — > e lx </>(jt). (4.61) 

The order parameter, however, transforms as 

TOc) -* e ix T(jc) (4.62) 

and hence does not observe the symmetry of the Hamiltonian. The 
phenomenological free energy describing 4 He is made up of two contributions. 
The main contribution is the condensation energy 

3b = ^\V(x)\ 2 + ^(x)\ 4 (4.63a) 

where a ~ ao(T — T c ) changes sign at the critical temperature T ~ 4 K. 
Figure 4.21 sketches To for T > T c and T < T c . If T > T c , the minimum 
of To is attained at 'I'(x) = 0 while if T < T c at |'T| = Ao = [—(6 a//3)] 1 / 2 . 
If T (x) depends on x, we have an additional contribution called the gradient 
energy 


Tgrad s • V*(X) 


(4.63b) 



Figure 4 . 21 . The free energy has a minimum at |'F| =0 for T > T c and at |'T| = Aq for 
T < T c . 


K being a positive constant. If the spatial variation of T (x) is mild enough, we 
may assume Ao is constant (the London limit). 

In the BCS theory of superconductors, the order parameter is given by 
(Tsuneto 1982) 

^aP = Wa(x)fp{x)) (4.64) 

i j/ a (x) being the (non-relativistic) electron field operator of spin a — (f , 4). It 
should be noted, however, that (4.64) is not an irreducible representation of the 
spin algebra. To see this, we examine the behaviour of '■P„p under a spin rotation. 
Consider an infinitesimal spin rotation around an axis n by an angle 6, whose 
matrix representation is 

9 u 

R = h + i — « M rr M , 

ay, being the Pauli matrices. Since \j/ a transforms as ir a — > R a ^^p we have 


'P, 


a/3 


R/'Kcc'P'R/ = {R • * • R l ) a p 


4/ 


i-/i(<r4 / (T2 — 'P 02<J ) 


aft 


where we note that — — 020 ^ 02 - Suppose 4' a , | g oc i(cr 2 )afi- Then 4> does not 
change under this rotation, hence it represents the spin-singlet pairing. We write 

4V(*) = A(x)(kr 2 )a/S = Ao(x)e 1<p(x) (ia 2 ) a p. (4.65a) 


If, however, we take 


Vapix) = A^(x)i ■ (r 2 ) a p 


(4.65b) 



we have 


^ap -> [A'' + Se fivl n v A^](ia M • a 2 ) a p. 

This shows that A ^ is a vector in spin space, hence (4.65b) represents the spin- 
triplet pairing. 

The order parameter of a conventional superconductor is of the form (4.65a) 
and we restrict the analysis to this case for the moment. In (4.65a), A(x) assumes 
the same form as T (x) of superfluid 4 He and the free energy is again given by 
(4.63). This similarity is attributed to the Cooper pair. In the superfluid state, 
a macroscopic number of 4 He atoms occupy the ground state (Bose-Einstein 
condensation) which then behaves like a huge molecule due to the quantum 
coherence. In this state creating elementary excitations requires a finite amount 
of energy and the flow cannot decay unless this critical energy is supplied. Since 
an electron is a fermion there is, at first sight, no Bose-Einstein condensation. 
The key observation is the Cooper pair. By the exchange of phonons, a pair of 
electrons feels an attractive force that barely overcomes the Coulomb repulsion. 
This tiny attractive force makes it possible for electrons to form a pair (in 
momentum space) that obeys Bose statistics. The pairs then condense to form 
the superfluid state of the Cooper pairs of electric charge 2e. 

An electromagnetic field couples to the system through the minimal coupling 

^grad = \ K |0„ - i2M„) A(JC)| 2 . (4.66) 

(The term 2e is used since the Cooper pair carries charge 2e.) Superconductors 
are roughly divided into two types according to their behaviour in applied 
magnetic fields. The type-I superconductor forms an intermediate state in which 
normal and superconducting regions coexist in strong magnetic fields. The 
type-II superconductor forms a vortex lattice (Abrikosov lattice) to confine the 
magnetic fields within the cores of the vortices with other regions remaining in 
the superconducting state. A similar vortex lattice has been observed in rotating 
superfluid 4 He in a cylinder. 

4.8.3 General consideration 

In the next two sections, we study applications of homotopy groups to the 
classification of defects in ordered media. The analysis of this section is based 
on Toulouse and Kleman (1976), Mermin (1979) and Mineev (1980). 

As we saw in the previous subsections, when a condensed matter system 
undergoes a phase transition, the symmetry of the system is reduced and this 
reduction is described by the order parameter. For definiteness, let us consider the 
three-dimensional medium of a superconductor. The order parameter takes the 
ioxm\!/(x) — AoOc)e ll?,( A. Let us consider a homogeneous system under uniform 
external conditions (temperature, pressure etc). The amplitude Ao is uniquely 
fixed by minimizing the condensation free energy. Note that there are still a large 
number of degrees of freedom left, i/r may take any value in the circle S l = U(l) 




Figure 4.22. A circle S 1 surrounding a line defect (vortex) is mapped to U(l) = S 1 . This 
map is classified by the fundamental group n\ (U(l). 


determined by the phase e" , ' i . In this way, a uniform system takes its value in 
a certain region M called the order parameter space. For a superconductor, 
M = 1/(1 ) . For the Heisenberg spin system, M — S 2 . The nematic liquid crystal 
has M — RP 2 while M = .S' 2 x SO(3) for the superfluid 3 He-A, see sections 4.9- 
4.10. 

If the system is in an inhomogeneous state, the gradient free energy cannot be 
negligible and i jr may not be in M. If the characteristic size of the variation of the 
order parameter is much larger than the coherence length, however, we may still 
assume that the order parameter takes its value in M, where the value is a function 
of position this time. If this is the case, there may be points, lines or surfaces in the 
medium on which the order parameter is not uniquely defined. They are called the 
defects. We have point defects (monopoles), line defects (vortices) and surface 
defects (domain walls) according to their dimensionalities. These defects are 
classified by the homotopy groups. 

To be more mathematical, let A be a space which is filled with the medium 
under consideration. The order parameter is a classical field x/s (x), which is also 
regarded as a map xfr : X — > M. Suppose there is a defect in the medium. For 
concreteness, we consider a line defect in the three-dimensional medium of a 
superconductor. Imagine a circle S l which encircles the line defect. If each part 
of S 1 is far from the line defect, much further than the coherence length % , we 
may assume the order parameter along S 1 takes its value in the order parameter 
space M = U(1 ), see figure 4.22. This is how the fundamental group comes into 
the problem; we talk of loops in a topological space U(l). The map S l — > U(l) 
is classified by the homotopy classes. Take a point ro e S l and require that ro be 
mapped to xq e M. By noting that 7Ti(U(l), xo) = Z, we may assign an integer 
to the line defect. This integer is called the winding number since it counts how 
many times the image of .S' 1 winds the space U(l). If two line defects have the 



same winding number, one can be continuously deformed to the other. If two 
line defects A and B merge together, the new line defect belongs to the homotopy 
class of the product of the homotopy classes to which A and B belonged before 
coalescence. Since the group operation in Z is an addition, the new winding 
number is a sum of the old winding numbers. A uniform distribution of the order 
parameter corresponds to the constant map i fr(x) = xq e M, which belongs to 
the unit element 0 e Z . If two line defects of opposite winding numbers merge 
together, the new line defect can be continuously deformed into the defect-free 
configuration. 

What about the other homotopy groups? We first consider the dimensionality 
of the defect and the sphere S' ' which surrounds it. For example, consider a point 
defect in a three-dimensional medium. It can be surrounded by S 2 and the defect 
is classified by Tt 2 {M, xq). If M has many components, 7Tq(M) is non-trivial. Let 
us consider a three-dimensional Ising model for which M — { |} U {f }. Then 
there is a domain wall on which the order parameter is not defined. For example, 
if S — f for x < 0 and S — for x > 0, there is a domain wall in the yz-plane 
at x = 0. In general, an m -dimensional defect in a c/-dimensiona] medium is 
classified by the homotopy group 7t n (M, xq) where 


n = d — m — 1 . 


(4.67) 


In the case of the Ising model, d = 3, m = 2; hence n = 0. 

4.9 Defects in nematic liquid crystals 

4.9.1 Order parameter of nematic liquid crystals 

Certain organic crystals exhibit quite interesting optical properties when they are 
in their fluid phases. They are called liquid crystals and they are characterized 
by their optical anisotropy. Here we are interested in so-called nematic liquid 
crystals. An example of this is octyloxy-cyanobiphenyl whose molecular structure 
is 



The molecule of a nematic liquid crystal is very much like a rod and the order 
parameter, called the director, is given by the average direction of the rod. Even 
though the molecule itself has a head and a tail, the director has an inversion 
symmetry; it does not make sense to distinguish the directors n = —*■ and —n = 
<r-. We are tempted to assign a point on .S' 2 to specify the director. This works 
except for one point. Two antipodal points n — (9, (p) and — n — (jc — 0, n + <p) 
represent the same state; see figure 4.23. Accordingly, the order parameter of the 
nematic liquid crystal is the projective plane HP 2 . The director field in general 



Figure 4.23. Since the director n has no head or tail, one cannot distinguish n from — n. 
Therefore, these two pictures correspond to the same order-parameter configuration. 



Figure 4.24. A vortex in a nematic liquid crystal, which corresponds to the non-trivial 
element of (R P 2 ) = %2- 

depends on the position r. Then we may define a map / : Iff’ — > ffi P 2 . This 
map is called the texture. The actual order-parameter configuration in iff’ is also 
called the texture. 

4.9.2 Line defects in nematic liquid crystals 

From example 4.10 we have jri(RP 2 ) = Z 2 = {0, 1}. There exist two kinds 
of line defect in nematic liquid crystals; one can be continuously deformed into 
a uniform configuration while the other cannot. The latter represents a stable 
vortex, whose texture is sketched in figure 4.24. The reader should observe how 
the loop a is mapped to RP 2 by this texture. 

Exercise 4.9. Show that the line ’defect’ in figure 4.25 is fictitious, namely the 
singularity at the centre may be eliminated by a continuous deformation of 
directors with directors at the boundary fixed. This corresponds to the operation 
1 + 1 = 0 . 


Figure 4.25. A line defect which may be continuously deformed into a uniform 
configuration. 



Figure 4.26. The texture of a point defect in a nematic liquid crystal. 


4.9.3 Point defects in nematic liquid crystals 

From example 4.14, we have t^IKP 2 ) = Z. Accordingly, there are stable point 
defects in the nematic liquid crystal. Figure 4.26 shows the texture of the point 
defects that belong to the class 1 e Z. 

It is interesting to point out that a line defect and a point defect may be 
combined into a ring defect, which is specified by both 7Ti(IRP 2 ) and ^(MF 2 ), 
see Mineev (1980). If the ring defect is observed from far away, it looks like 



Figure 4.27. The texture of a ring defect in a nematic liquid crystal. The loop a classifies 
7ri(!LP 2 ) while the sphere (2-loop) fi classifies ^(RP 2 ). 


a point defect, while its local structure along the ring is specified by 7ri(IRP 2 ). 
Figure 4.27 is an example of such a ring defect. The loop a classifies jt\ (IRP 2 ) = 
hi while the sphere (2-loop) ft classifies 7r2(ffi.P 2 ) = Z. 


4.9.4 Higher dimensional texture 

The third homotopy group 7r(IRP 2 ) = Z leads to an interesting singularity- 
free texture in a three-dimensional medium of nematic liquid crystal. Suppose 
the director field approaches an asymptotic configuration, say n = (1,0, 0) 1 , 
as |r| — > oo. Then the medium is effectively compactified into the three- 
dimensional sphere S 3 and the topological structure of the texture is classified 
by 7i3 (IRP 2 ) = Z. What is the texture corresponding to a non-trivial element of 
the homotopy group? 

An arbitrary rotation in K 3 is specified by a unit vector e, around which the 
rotation is carried out, and the rotation angle a. It is possible to assign a ‘vector’ 
£2 = ae to this rotation. It is not exactly a vector since £2 = ire and — £2 = —jte 
are the same rotation and hence should be identified. Therefore, £2 belongs to the 
real projective space IRP 3 . Suppose we take hq = (1,0, 0) 1 as a standard director. 
Then an arbitrary director configuration is specified by rotating n o around some 
axis e by an angle a: n = R(e, a)n o, where R(e. a) is the corresponding rotation 
matrix in SO(3). Suppose a texture field is given by applying the rotation 


ae(r) = f(r)r 


(4.68) 




(a) ( b ) 


Figure 4.28. The texture of the non-trivial element of t^ORP 2 ) = Z. (a) shows the 
rotation ‘vector' ae. The lengths approaches n as |r| -*■ oo. ( b ) shows the corresponding 
director field. 


to no, where r is the unit vector in the direction of the position vector r and 

f 0 r = 0 

m = 

7T r -» oo. 

Figure 4.28 shows the director field of this texture. Note that although there 
is no singularity in the texture, it is impossible to ‘wind off’ this to a uniform 
configuration. 


4.10 Textures in superfluid 3 He-A 
4.10.1 Superfluid 3 He-A 

Here comes the last and most interesting example. Before 1972 the only example 
of the BCS superfluid was the conventional superconductor (apart from indirect 
observations of superfluid neutrons in neutron stars). Figure 4.29 is the phase 
diagram of superfluid 3 He without an external magnetic field. From NMR and 
other observations, it turns out that the superfluid is in the spin-triplet p-wave 
state. Instead of the field operators (see (4.65b)), we define the order parameter 
in terms of the creation and annihilation operators. The most general form of the 
triplet superfluid order parameter is 


3 

( c ct,k c fi,—k ) (\^20’ii)apdf l (k) 

1 


(4.69a) 


Solid 



Figure 4.29. The phase diagram of superfluid 3 He. 


where a and ft are spin indices. The Cooper pair forms in the p-wave state hence 
d^(k) is proportional to Y\ m ~ hi, 


3 

dn(k) = J2 A o A Lu k i. (4.69b) 

i=\ 

The bulk energy has several minima. The absolute minimum depends on the 
pressure and the temperature. We are particularly interested in the A phase in 
figure 4.29. 

The A-phase order parameter takes the form 

A fli =d„(A l +i\ 2 )i (4.70) 

where d is a unit vector along which the spin projection of the Cooper pair 
vanishes and ( A i , A 2 ) is a pair of orthonormal unit vectors. The vector d takes 
its value in S 2 . If we define / = A] x Aj. the triad ( A 1 . At,/) forms an 
orthonormal frame at each point of the medium. Since any orthonormal frame 
can be obtained from a standard orthonormal frame (e 1 , ej, £' 3 ) by an application 
of a three-dimensional rotation matrix, we conclude that the order parameter of 
3 He-A is S 2 x SO(3). The vector / introduced here is the axis of the angular 
momentum of the Cooper pair. 

For simplicity, we neglect the variation of the d- vector. [In fact, d is locked 



along / due to the dipole force.] The order parameter assumes the form 

A t = A 0 (A! + A 2 )j (4.71) 

where A i , A 2 and / = A i x A 2 form an orthonormal frame at each point of 
the medium. Let us take a standard orthonormal frame (ci, e 2 , e 2 ). The frame 
(A[ , A 2,1) is obtained by applying an element g e SO(3) to the standard frame, 

g : (ei,e2,e 2 ) ->• (Ai, A 2 ,Z). (4.72) 

Since g depends on the coordinate x, the configuration ( A i (x ) , A 2 (x), / (x)) 
defines a map \[r : X -> SO(3) asr i-> g (x ) . The map i fr is called the texture 
of a superfluid "’He. 1 The relevant homotopy groups for classifying defects in 
superfluid 3 He- A are ir n (SO (3)). 

If a container is filled with 3 He-A, the boundary poses certain conditions on 
the texture. The vector / is understood as the direction of the angular momentum 
of the Cooper pair. The pair should rotate in the plane parallel to the boundary 
wall, thus / should be perpendicular to the wall. [Remark: If the wall is diffuse, 
the orbital motion of Cooper pairs is disturbed and there is a depression in the 
amplitude of the order parameter in the vicinity of the wall. We assume, for 
simplicity, that the wall is specularly smooth so that Cooper pairs may execute 
orbital motion with no disturbance.] There are several kinds of free energy and 
the texture is determined by solving the Euler-Lagrange equation derived from 
the total free energy under given boundary conditions. 

Reviews on superfluid 3 He are found in Anderson and Brinkman (1975), 
Leggett (1975) and Mermin (1978). 

4.10.2 Line defects and non-singular vortices in 3 He-A 

The fundamental group of SO(3) = HP 3 is 7ri(IRP 3 ) = Z 2 = {0, 1}. 
Textures which belong to class 0 can be continuously deformed into the uniform 
configuration. Configurations in class 1 are called disgyrations and have 
been analysed by Maki and Tsuneto (1977) and Buchholtz and Fetter (1977). 
Figure 4.30 describes these disgyrations in their lowest free energy configurations. 

A remarkable property of Z 2 is the addition 1 + 1=0; the coalescence of 
two disgyrations produces a trivial texture. By merging two disgyrations, we may 
construct a texture that looks like a vortex of double vorticity (homotopy class 
‘2’) without a singular core; see figure 4.31(a). It is easy to verify that the image 
of the loop a traverses HP 3 twice while that of the smaller loop ft may be shrunk 
to a point. This texture is called the Anderson-Toulouse vortex (Anderson and 
Toulouse 1977). Mermin and Ho (1976) pointed out that if the medium is in a 
cylinder, the boundary imposes the condition / _L (boundary) and the vortex is 
cut at the surface, see figure 4.31(i>) (the Mermin-Ho vortex). 

1 The name ‘texture’ is, in fact, borrowed from the order-parameter configuration in liquid crystals, 
see section 4.9. 


(a) (b) ( c ) 



Figure 4.30. Disgyrations in 3 He- A. 


(a) ( b ) 



Figure 4.31. The Anderson-Toulouse vortex (a) and the Mermin-Ho vortex ( b ). In (b) the 
boundary forces / to be perpendicular to the wall. 

Since 7 T 2 O&P 3 ) = {e}, there are no point defects in 3 He-A. However, 
713 (MP 3 ) = Z introduces a new type of pointlike structure called the Shankar 
monopole, which we will study next. 

4.10.3 Shankar monopole in 3 He-A 

Shankar (1977) pointed out that there exists a pointlike singularity-free object 
in 3 He-A. Consider an infinite medium of 3 He-A. We assume the medium is 
asymptotically uniform, that is, (Ai, A2,/) approaches a standard orthonormal 
frame (ei , c' 2 , £' 3 ) as |v -> 00 . Since all the points far from the origin are mapped 
to a single point, we have compactified M 3 to .S' 3 . Then the texture is classified 
according to 7 T 3 (KP 3 ) = Z. Let us specify an element of SO(3) by a ‘vector’ 
S2 = On in IRP 3 as before (example 4.12). Shankar (1977) proposed a texture, 

fi(r) = - • f{r) 
r 


(4.73) 




Figure 4.32. The Shankar monopole: (a) shows the ‘vectors' ft(r) and (b) shows the triad 
( A i , A 2 , /). Note that as |r| — > 00 the triad approaches the same configuration. 


where f(r) is a monotonically decreasing function such that 


fir) = 



r = 0 
r = 00 . 


(4.74) 


We formally extend the radius of I RP 3 to 2 it and define the rotation angle modulo 
2i r. This texture is called the Shankar monopole, see figure 4.32(a). At first sight 
it appears that there is a singularity at the origin. Note, however, that the length 
of ft is 2 7r there and it is equivalent to the unit element of SO(3). Figure 4.32(h) 
describes the triad field. Since ft(r) = 0 as r 00 , irrespective of the direction, 
the space M 3 is compactified to S 3 . As we scan the whole space, ft(r) sweeps 
SO(3) twice and this texture corresponds to class 1 of 773 ( S O ( 3 ) ) = Z. 


Exercise 4.10. Sketch the Shankar monopole which belongs to the class —1 of 
773 (KLP 3 ). [You cannot simply reverse the arrows in figure 4.32.] 

Exercise 4.11. Consider classical Heisenberg spins defined in IR 2 , see section4.8. 
Suppose spins take the asymptotic value 


n(x) -» e z \x\ > L 


(4.75) 


for the total energy to be finite, see figure 4.20. Show that the extended objects in 
this system are classified by 7Z2(S 2 ). Sketch examples of spin configurations for 
the classes — 1 and +2. 


Problems 

4.1 Show that the n-sphere S n is a deformation retract of punctured Euclidean 
space R n+l — {0}. Find a retraction. 


4.2 Let D 2 be the two-dimensional closed disc and .S' 1 = 3 D 2 be its boundary. 
Let / : D 2 -* D 2 be a smooth map. Suppose / has no fixed points, namely 
f(p) ^ p for any p e D 2 . Consider a semi-line starting at p through f(p) (this 
semi-line is always well defined if p ^ f(p))- The line crosses the boundary at 
some point q e S l . Then define / : D 2 — »■ S' 1 by f(p) — q. Use Tries' 1 ) = Z 
and jc\{D 2 ) = {0} to show that such an / does not exist and hence, that / must 
have fixed points. [Hint: Show that if such an / existed, D 2 and .S' 1 would be of 
the same homotopy type.] This is the two-dimensional version of the Brouwer 
fixed-point theorem. 

4.3 Construct a map / : S 3 -> S 2 which belongs to the elements 0 and 1 of 
TtT,(S 2 ) = Z. See also example 9.9. 
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MANIFOLDS 


Manifolds are generalizations of our familiar ideas about curves and surfaces to 
arbitrary dimensional objects. A curve in three-dimensional Euclidean space is 
parametrized locally by a single number t as (x(t), y(t), z(t)), while two numbers 
u and v parametrize a surface as ( x(u , v), y(u, v), z(u, v)). A curve and a surface 
are considered locally homeomorphic to ffi and IR' 2 , respectively. A manifold, 
in general, is a topological space which is homeomorphic to ffi'" locally, it may 
be different from ffi'" globally. The local homeomorphism enables us to give 
each point in a manifold a set of m numbers called the (local) coordinate. If a 
manifold is not homeomorphic to ffi'" globally, we have to introduce several local 
coordinates. Then it is possible that a single point has two or more coordinates. 
We require that the transition from one coordinate to the other be smooth. As 
we will see later, this enables us to develop the usual calculus on a manifold. 
Just as topology is based on continuity, so the theory of manifolds is based on 
smoothness. 

Useful references on this subject are Crampin and Pirani (1986), Matsushima 
(1972), Schutz (1980) and Warner (1983). Chapter 2 and appendices B and C of 
Wald (1984) are also recommended. Flanders (1963) is a beautiful introduction 
to differential forms. Sattinger and Weaver (1986) deals with Lie groups and Lie 
algebras and contains many applications to problems in physics. 

5.1 Manifolds 

5.1.1 Heuristic introduction 

To clarify these points, consider the usual sphere of unit radius in ffi 3 . We 
parametrize the surface of S 2 , among other possibilities, by two coordinate 
systems — polar coordinates and stereographic coordinates. Polar coordinates 6 
and <p are usually defined by (figure 5.1) 


x = sin 6 cos 4> y = sin 6 sin 4> z = cos 9 , 


(5.1) 


where </; runs from 0 to 2 it and 6 from 0 to n . They may be inverted on the sphere 
to yield 




(5.2) 


z x 



Stereographic coordinates, however, are defined by the projection from the North 
Pole onto the equatorial plane as in figure 5.1. First, join the North Pole (0, 0, 1) 
to the point P(.x,y,z ) on the sphere and then continue in a straight line to 
the equatorial plane z = 0 to intersect at Q(X . Y, 0). Then X and Y are the 
stereographic coordinates of P. We find 



The two coordinate systems are related as 

X — cot IjQ cos cp Y = cot j6 sin cp. (5.4) 

Of course, other systems, polar coordinates with different polar axes or 
projections from different points on S 2 , could be used. The coordinates on the 
sphere may be kept arbitrary until some specific calculation is to be carried out. 
[The longitude is historically measured from Greenwich. However, there is no 
reason why it cannot be measured from New York or Kyoto.] This arbitrariness 
of the coordinate choice underlies the theory of manifolds: all coordinate systems 
are equally good. It is also in harmony with the basic principle of physics: a 
physical system behaves in the same way whatever coordinates we use to describe 
it. 



Another point which can be seen from this example is that no coordinate 
system may be usable everywhere at once. Let us look at the polar coordinates 
on S 2 . Take the equator (6 = jit ) for definiteness. If we let </> range from 0 to 
2jt, then it changes continuously as we go round the equator until we get all the 
way to </> = 2jt. There the (/(-coordinate has a discontinuity from 2jt to 0 and 
nearby points have quite different (/(-values. Alternatively we could continue 4> 
through 2tc . Then we will encounter another difficulty: at each point we must 
have infinitely many (/(-values, differing from one another by an integral multiple 
of 27r. A further difficulty arises at the poles, where (f> is not determined at all. 
[An explorer on the Pole is in a state of timelessness since time is defined by the 
longitude.] Stereographic coordinates also have difficulties at the North Pole or 
at any projection point that is not projected to a point on the equatorial plane; and 
nearby points close to the Pole have widely different stereographic coordinates. 

Thus, we cannot label the points on the sphere with a single coordinate 
system so that both of the following conditions are satisfied. 

(i) Nearby points always have nearby coordinates. 

(ii) Every point has unique coordinates. 

Note, however, that there are infinitely many ways to introduce coordinates that 
satisfy these requirements on a part of S 2 . We may take advantage of this fact to 
define coordinates on S 2 : introduce two or more overlapping coordinate systems, 
each covering a part of the sphere whose points are to be labelled so that the 
following conditions hold. 

(i') Nearby points have nearby coordinates in at least one coordinate system, 
(ii') Every point has unique coordinates in each system that contains it. 

For example, we may introduce two stereographic coordinates on S 2 , one a 
projection from the North Pole, the other from the South Pole. Are these 
conditions (i') and (ii') enough to develop sensible theories of the manifold? In 
fact, we need an extra condition on the coordinate systems. 

(iii) If two coordinate systems overlap, they are related to each other in a 
sufficiently smooth way. 

Without this condition, a differentiable function in one coordinate system 
may not be differentiable in the other system. 

5.1.2 Definitions 

Definition 5.1. M is an m -dimensional differentiable manifold if 

(i) M is a topological space; 

(ii) M is provided with a family of pairs {({//, </?/)}; 

(iii) {[/;} is a family of open sets which covers M, that is, U = M. (p, is a 
homeomorphism from t/; onto an open subset U\ of W" (figure 5.2); and 



Figure 5.2. A homeomorphism i pj maps Uj onto an open subset U' C R m , providing 
coordinates to a point p e Uj . If Uj fl Uj ^ 0, the transition from one coordinate system 
to another is smooth. 


(iv) given Uj and Uj such that Uj fl Uj ^ 0, the map i j/jj = <pj o q> j 1 from 

(Pj(Uj fl Uj) to (pi(Uj fl Uj) is infinitely differentiable. 

The pair ( Uj,(pj ) is called a chart while the whole family {(£/,-, q>j)} is 
called, for obvious reasons, an atlas. The subset Uj is called the coordinate 
neighbourhood while (pi is the coordinate function or, simply, the coordinate. 
The homeomorphism (pj is represented by m functions {x Up ), . . . , x m ip)}. The 
set {x^ip)} is also called the coordinate. A point p e M exists independently of 
its coordinates; it is up to us how we assign coordinates to a point. We sometimes 
employ the rather sloppy notation x to denote a point whose coordinates are 
{x 1 , . . . , x"'}, unless several coordinate systems are in use. From (ii) and (iii), M 
is locally Euclidean. In each coordinate neighbourhood Uj , M looks like an open 
subset of W m whose element is {x 1 , . . . , x" 1 }. Note that we do not require that M 
be W m globally. We are living on the earth whose surface is S 2 , which does not 
look like M 2 globally. However, it looks like an open subset of M 2 locally. Who 
can tell that we live on the sphere by just looking at a map of London, which, of 
course, looks like a part of IR 2 7 1 

1 Strictly speaking the distance between two longitudes in the northern part of the city is slightly 



If Uj and Uj overlap, two coordinate systems are assigned to a point in 
Ui fl Uj. Axiom (iv) asserts that the transition from one coordinate system to 
another be smooth ( C °° ). The map cp, assigns m coordinate values x 1 ' (1 < /x < 
m ) to a point p e U, fl Uj, while <pj assigns y v (1 < v < m) to the same 
point and the transition from y to x, x 1 ' = x 1 ' (y), is given by m functions of m 
variables. The coordinate transformation functions x ,L = x ll (y) are the explicit 
form of the map ip-ji = <p/ o (p~ l . Thus, the differentiability has been defined 
in the usual sense of calculus: the coordinate transformation is differentiable if 
each function x jl (y) is differentiable with respect to each y v . We may restrict 
ourselves to the differentiability up to A'th order (C k ). However, this does not 
bring about any interesting conclusions. We simply require, instead, that the 
coordinate transformations be infinitely differentiable, that is, of class C°°. Now 
coordinates have been assigned to M in such a way that if we move over M in 
whatever fashion, the coordinates we use vary in a smooth manner. 

If the union of two atlases {(Ui, (pi)} and {(V /, i fr/)} is again an atlas, these 
two atlases are said to be compatible. The compatibility is an equivalence 
relation, the equivalence class of which is called the differentiable structure. It is 
also said that mutually compatible atlases define the same differentiable structure 
on M. 

Before we give examples, we briefly comment on manifolds with 
boundaries. So far, we have assumed that the coordinate neighbourhood [/, is 
homeomorphic to an open set of JR"' . In some applications, however, this turns 
out to be too restrictive and we need to relax this condition. If a topological space 
M is covered by a family of open sets { Uj } each of which is homeomorphic to an 
open set of H m = {(x 1 , . . . , x m ) e IR m \x m > 0}, M is said to be a manifold with 
a boundary, see figure 5.3. The set of points which are mapped to points with 
x m = 0 is called the boundary of M, denoted by 3 M. The coordinates of 3 M 
may be given by m — 1 numbers (x 1 , . . . , x m , 0). Now we have to be careful 
when we define the smoothness. The map : <Pj(Ui fl Uj) —*■ cpi(Ui fl Uj) 
is defined on an open set of H m in general, and i j/jj is said to be smooth if it is 
C°° in an open set of ffi m which contains <pj ( U / fl Uj). Readers are encouraged to 
use their imagination since our definition is in harmony with our intuitive notions 
about boundaries. For example, the boundary of the solid ball Z) 3 is the sphere S 2 
and the boundary of the sphere is an empty set. 

5.1.3 Examples 

We now give several examples to develop our ideas about manifolds. They are 
also of great relevance to physics. 

Example 5.1. The Euclidean space ffi m is the most trivial example, where a single 
chart covers the whole space and (p may be the identity map. 


shorter than that in the southern part and one may suspect that one lives on a curved surface. Of 
course, it is the other way around if one lives in a city in the southern hemisphere. 



Figure 5.3. A manifold with a boundary. The point p is on the boundary. 

Example 5.2. Let m — 1 and require that M be connected. There are only two 
manifolds possible: a real line ffi and the circle .S' 1 . Let us work out an atlas of S 1 . 
For concreteness take the circle x 2 + y l = 1 in the jcy-plane. We need at least 
two charts. We may take them as in figure 5.4. Define <p ^ 1 : (0, 2tc) — > .S' 1 by 

: 6 M- (cos0, sin0) (5.5a) 

whose image is S 1 — {(1, 0)}. Define also : {—it, it) — »■ 5 1 by 

i : 0 i — > (cos 9, sin0) (5.5b) 

whose image is S l — {(—1, 0)}. Clearly <p and are invertible and all the 
maps ip \ , <p 2 , *p\ 1 and <p^ 1 are continuous. Thus, q>\ and q >2 are homeomorphisms. 
Verify that the maps t/q 2 — <p\ o <p^ 1 and 1/^21 = (pi 0 1 are smooth. 

Example 5.3. The n-dimensional sphere S n is a differentiable manifold. It is 
realized in M" +1 as 

n 

£(*') 2 = 1. (5.6) 

i=0 

Let us introduce the coordinate neighbourhoods 

U i+ = {(x°,x\ ...,x n )e S n \x‘ > 0} 

Ut- = {U°, x\ ..., x n ) e S n \x‘ < 0}. 


(5.7a) 

(5.7b) 


0 6 n 
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Figure 5.4. Two charts of a circle S 1 . 


Define the coordinate map <pt+ : t/,-+ —*■ E" by 

(Pi + (x °, . . . , x n ) — (x°, . . . , x' _1 , x' +1 , . . . , x n ) (5.8a) 

and (pi _ : [/;_ — > E" by 

<Pi-(x°, ..., x n ) = (x°, . . . , x ,_1 . x /+1 , . . . , x n ). (5.8b) 

Note that the domains of (pt+ and cpi- are different. cpj± are the projections of the 
hemispheres t/;± to the plane x' = 0. The transition functions are easily obtained 
from (5.8). Take S 2 as an example. The coordinate neighbourhoods are U x ±, U y ± 
and U z ±. The transition function ir y - x + = (p y - ° (p x + is given by 

ify-x+ : (y, z) -y 2 -z 2 , z'j (5.9) 

which is infinitely differentiable on U x+ fl t/ v _ . 

Exercise 5.1. At the beginning of this chapter, we introduced the stereographic 
coordinates on S 2 . We may equally define the stereographic coordinates projected 
from points other than the North Pole. For example, the stereographic coordinates 
([/, V ) of a point in S 2 — {South Pole} projected from the South Pole and ( X , Y) 
for a point in S 2 — {North Pole} projected from the North Pole are shown in figure 
5.5. Show that the transition functions between ({/, V) and (X. Y) are C°° and 
that they define a differentiable structure on M. See also example 8.1. 

Example 5.4. The real projective space E P" is the set of lines through the origin 
in E" +1 . If x = (x , . . . , x n ) ^ 0, x defines a line through the origin. Note 
that v e E" +1 defines the same line as x if there exists a real number a ^ 0 
such that y = ax. Introduce an equivalence relation ~ by x ~ y if there 



Figure 5.5. Two stereographic coordinate systems on S 2 . The point P may be projected 
from the North Pole N giving (X, Y ) or from the South Pole S giving (U, V). 


exists a e ffi — {0} such that y — ax. Then MP" = (K' !+1 — {0})/ ~. The 
n + 1 numbers x , x 1 , . . . , x n are called the homogeneous coordinates. The 
homogeneous coordinates cannot be a good coordinate system, since KP" is an 
n-dimensional manifold (an (n + l)-dimensional space with a one-dimensional 
degree of freedom killed). The charts are defined as follows. First we take the 
coordinate neighbourhood U, as the set of lines with x l ^ 0, and then introduce 
the inhomogeneous coordinates on £/; by 

Sl ) =xi/x i . (5.10) 

The inhomogeneous coordinates 

iw-C %> 

with = 1 omitted, are well defined on t/,- since x l ^ 0, and furthermore 
they are independent of the choice of the representative of the equivalence class 
since x J /x l — y J /y‘ if y = ax. The inhomogeneous coordinate £(,) gives the 
coordinate map cpi : U, —> ffi” , that is 

^ : (x°, . . . , x n ) i-> (x 0 /x‘, ..., x^/x 1 , x^/x 1 , ..., x n lx l ) 

where x' lx' = 1 is omitted. For x — (x°, x 1 , . . . , x") e Uj fl Uj we assign 
two inhomogeneous coordinates, = x k /x l and = x k /xK The coordinate 



(5.11) 


transformation i jrij — (p, o (p. 1 is 

This is a multiplication by x J /x ' . 

In example 4.12, we defined IK P n as the sphere S" with antipodal points 
identified. This picture is in conformity with the definition here. As a 
representative of the equivalence class [x], we may take points |x| = 1 on a line 
through the origin. These are points on the unit sphere. Since there are two points 
on the intersection of a line with S' ' we have to take one of them consistently, 
that is nearby lines are represented by nearby points in S n . This amounts to 
taking the hemisphere. Note, however, that the antipodal points on the boundary 
(the equator of S n ) are identified by definition, (x , . . . , x") ~ — (x , . . . , x"). 
This ‘hemisphere’ is homeomorphic to the ball D" with antipodal points on the 
boundary S"~ l identified. 

Example 5.5. A straightforward generalization of RP" is the Grassmann 
manifold. An element of IK P" is a one-dimensional subspace in IR " +1 . The 
Grassmann manifold Gk, n (IK) is the set of /.'-dimensional planes in IK" . Note that 
Gi : „+i (IK) is nothing but RP n . The manifold structure of Gk.n (IK) is defined in a 
manner similar to that of MP n . 

Let (IK) be the set of k x n matrices of rank/' (k < ri). Take A = 
( ajj ) e M\, n (IK) and define k vectors a, (1 < i < k ) in IK" by a, = iflij). Since 
rank A — k. k vectors a, are linearly independent and span a ^-dimensional plane 
in IK" . Note, however, that there are infinitely many matrices in Mk, n (IK) that yield 
the same A:-plane. Take g e GL (k, M) and consider a matrix A — gA e Mk,„ (IK). 
A defines the same /.-plane as A, since g simply rotates the basis within the k- 
plane. Introduce an equivalence relation ~ by A ~ A if there exists g e GLIA:, IK) 
such that A = gA. We identify G^.n (IK) with the coset space Mk. n (TK)/GL(Ac, IK). 

Let us find the charts of G^.n (IK). Take A e Mk, n (IK) and let { A i , . . . , A/}, 
/ = be the collection of all k x k minors of A. Since rank A = k , there exists 
some A a (1 < a < l) such that det A ^ 0. For example, let us assume the minor 
A i made of the first k columns has non-vanishing determinant, 

A = (A,,A!) (5.12) 

where Ai is a k x ( n — k) matrix. Let us take the representative of the class to 
which A belongs to be 

AY 1 ■ A = (I k , A- 1 ■ Ai) (5.13) 

where 4- is the k x k unit matrix. Note that A’j" 1 always exists since det Ai / 0. 
Thus, the real degrees of freedom are given by the entries of the k x (n — k) 
matrix A^ 1 • A i. We denote this subset of Gk.n(R) by JJ\. U\ is a coordinate 
neighbourhood whose coordinates are given by k(n — k) entries of A] -1 • A\. 
Since U\ is homeomorphic to M k{n ~ k ' 1 we find that 


dim Gk.n (IK) = k(n — k). 


(5.14) 



In the case where det A a ^ 0, where A a is composed of the columns 
(i i, h, . . . , ik), we multiply A" 1 to obtain the representative 

column i\ z '2 ... ik 


/ .. 

. 1 . 

. 0 ... 

... 0 

... \ 



. 0 . 

. 1 ... 

... 0 


(5.15) 

V 

. 0 . 

. 0 ... 

... 1 

... ) 



where the entries not written explicitly form a k x (n — k ) matrix. We denote this 
subset of Mk, n (K) with det A a ^ 0 by U a . The entries of the k x (n — k) matrix 
are the coordinates of U a . 

The relation between the projective space and the Grassmann manifold is 
evident. An element of Mi „+i(IR) is a vector A = (x°, x 1 , . . . , x n ). Since the 
ath minor A a of A is a number x a , the condition det A a ^ 0 becomes x a ^ 0. 
The representative (5.15) is just the inhomogeneous coordinate 

(. x a r\x 0 ,x l ,...,x a ,...,x n ) 

= (x°/x 0l ,x 1 /x 01 , ..., x a /x a = 1, . . . , x n /x a ). 

Let M be an m -dimensional manifold with an atlas {((/,, (pi)) and N be an n- 
dimensional manifold with {(V), i// ; ) ) . A product manifold M x N is an (m+n)- 
dimensional manifold whose atlas is {(U, x V j), ((pi, V''/)}- A point in M x N 
is written as ( p , q), p e M,q e N, and the coordinate function (<pi, (J/j) acts on 
(p, q) to yield (c pi(p ), i /fj(p)) e E'" + ". The reader should verify that a product 
manifold indeed satisfies the axioms of definition 5.1. 

Example 5.6. The torus T 2 is a product manifold of two circles, T 2 = S 1 x S 1 . If 
we denote the polar angle of each circle as 0, mod 2 n (i = 1,2), the coordinates 
of T 2 are (0 \ , 02 ). Since each S 1 is embedded in M 2 , T 2 may be embedded in M 4 . 
We often imagine T 2 as the surface of a doughnut in M 3 , in which case, however, 
we inevitably have to introduce bending of the surface. This is an extrinsic feature 
brought about by the ‘embedding’. When we say ‘a torus is a flat manifold’, we 
refer to the flat surface embedded in ffi 4 . See definition 5.3 for further details. 

We may also consider a direct product of n circles, 

T n = S l x S l x x S\. 

~~ - ^ 
n 

Clearly T" is an « -dimensional manifold with the coordinates (6i, 02, ■ ■ • ,6n) 
mod27r. This may be regarded as an u-cube whose opposite faces are identified, 
see figure 2.4 for n = 2. 

5.2 The calculus on manifolds 

The significance of differentiable manifolds resides in the fact that we may use 
the usual calculus developed in M" . Smoothness of the coordinate transformations 



ensures that the calculus is independent of the coordinates chosen. 

5.2.1 Differentiable maps 

Let / : M — > N be a map from an m -dimensional manifold M to an n- 
dimensional manifold N. A point p e M is mapped to a point /( p) e N, namely 
f : p i-> f(p), see figure 5.6. Take a chart (U, (p) on M and (V, \[r) on N, where 
p e U and fip) e V. Then / has the following coordinate presentation: 

fofocp- 1 : IIP -> ffi". (5.16) 

If we write (pip) — {x /l ) and 4r(f(p)) — {y 01 }, \jf o f o <p~ x is just the usual 
vector-valued function y = (js o / o <p -1 (x) of m variables. We sometimes use 
(in fact, abuse!) the notation y = fix) or y a = f a (x ,x ), when we know which 
coordinate systems on M and N are in use. \f y — \jr o f o (p~ l ix), or simply 
y a — f a (x 11 ), is C°° with respect to each x ! \ f is said to be differentiable at 
p or at x = (pip). Differentiable maps are also said to be smooth. Note that 
we require infinite (C°°) differentiability, in harmony with the smoothness of the 
transition functions i /r,-y . 

The differentiability of / is independent of the coordinate system. Consider 
two overlapping charts (t/i, q>\) and (U 2 , (pi)- Take a point p e U\ fl U 2 , whose 
coordinates by (p\ are {.Vj 4 }, while those by (P 2 are {x^}- When expressed in 
terms of {xf 4 }, / takes the form tfr o f o (pf l , while in {x^}, ° / ° (p = 



\[r o / o <p^ x ((p\ o cpfi 1 ). By definition, 1//12 = tp\ o tp^ is C°° . In the simpler 
expressions, they correspond to y — f (x \ ) and y = f(x\ (xj)). It is clear that 
if f(x 1 ) is C°° with respect to x ^ and xi(x 2 ) is C°° with respect to x%, then 
y — f(x i(x 2 )) is also C°° with respect to x%. 

Exercise 5.2. Show that the differentiability of / is also independent of the chart 
in N. 

Definition 5.2. Let f : M —> N be a homeomorphism and fi and <p be coordinate 
functions as previously defined. If \[r o / o (p~ l is invertible (that is, there exists a 
map cp o /' - o and both y = fo/o cp~ l (x ) and x = <p o / -1 o fi~ 1 (y) 
are C°°, / is called a diffeomorphism and M is said to be diffeomorphic to N 
and vice versa, denoted by M = N. 

Clearly dimM = dim N if M = N. In chapter 2, we noted that 
homeomorphisms classify spaces according to whether it is possible to deform 
one space into another continuously. Diffeomorphisms classify spaces into 
equivalence classes according to whether it is possible to deform one space to 
another smoothly. Two diffeomorphic spaces are regarded as the same manifold. 
Clearly a diffeomorphism is a homeomorphism. What about the converse? Is 
a homeomorphism a diffeomorphism? In the previous section, we defined the 
differentiable structure as an equivalence class of atlases. Is it possible for a 
topological space to carry many differentiable structures? It is rather difficult 
to give examples of ‘diffeomorphically inequivalent homeomorphisms’ since it is 
known that this is possible only in higher-dimensional spaces (dim M > 4). It 
was believed before 1956 that a topological space admits only one differentiable 
structure. However, Milnor (1956) pointed out that S 1 admits 28 differentiable 
structures. A recent striking discovery in mathematics is that M 4 admits an infinite 
number of differentiable structures. Interested readers should consult Donaldson 
(1983) and Freed and Uhlenbeck (1984). Here we assume that a manifold admits 
a unique differentiable structure, for simplicity. 

The set of diffeomorphisms f \ M —> M is a group denoted by Diff(M). 
Take a point p in a chart ((/, cp) such that cp(p) = x^(p). Under / e Diff(M), 
p is mapped to f(p ) whose coordinates are (p(f{p )) = y^ifip)) (we have 
assumed fip) e U). Clearly y is a differentiable function of x; this is an active 
point of view to the coordinate transformation. However, if (U, cp) and (V, xjr) are 
overlapping charts, we have two coordinate values x lL = (pip) andy M = fiip) for 
a point p e U (T V. The map jc 1 — ^ y is differentiable by the assumed smoothness 
of the manifold; this reparametrization is a passive point of view to the coordinate 
transformation. We also denote the group of reparametrizations by Diff(M). 

Now we look at special classes of mappings, namely curves and functions. 
An open curve in an m -dimensional manifold M is a map c : (a, h) — > M where 
(a, b) is an open interval such that a < 0 < b. We assume that the curve does 
not intersect with itself (figure 5.7). The number a ( b ) may be —00 (+ 00 ) and 
we have included 0 in the interval for later convenience. If a curve is closed, it is 



Figure 5.7. A curve c in M and its coordinate presentation cp o c. 

regarded as a map c : S 1 — »• M. In both cases, c is locally a map from an open 
interval to M. On a chart (U, (p), a curve cit ) has the coordinate presentation 
t = pc:l^P. 

A function / on M is a smooth map from M to M, see figure 5.8. On a chart 
(U, cp), the coordinate presentation of / is given by / o (p~ { : IR m -> IK which is 
a real- valued function of m variables. We denote the set of smooth functions on 
Mby 9{M). 

5.2.2 Vectors 

Now that we have defined maps on a manifold, we are ready to define other 
geometrical objects: vectors, dual vectors and tensors. In general, an elementary 
picture of a vector as an arrow connecting a point and the origin does not work in 
a manifold. [Where is the origin? What is a straight arrow? How do we define a 
straight arrow that connects London and Los Angeles on the surface of the Earth?] 
On a manifold, a vector is defined to be a tangent vector to a curve in M. 

To begin with, let us look at a tangent line to a curve in the ry -plane. If the 
curve is differentiable, we may approximate the curve in the vicinity of .ro by 

y - y(*o) = a(x - xo) (5.17) 

where a = Ay/Ax\ x=Xo . The tangent vectors on a manifold M generalize this 
tangent line. To define a tangent vector we need a curve c : (a, b) —*■ M and 
a function / : M — > K, where (a, b) is an open interval containing t = 0, see 
figure 5.9. We define the tangent vector at c(0) as a directional derivative of a 
function f(c(t)) along the curve c(t) at t = 0. The rate of change of f(c(t)) at 


R 



Figure 5.8. A function / : M — y R and its coordinate presentation / o tp 


1 — 0 along the curve is 

d f(c(t)) 
dt t = 

In terms of the local coordinate, this becomes 

3/ dx IJ (c(t)) 
dx^ dr 


(5.18) 


(5.19) 


[Note the abuse of the notation! The derivative df/dx 11 really means 3 (/ o 
(p~ l {x))/ dx 11 .] In other words, d/(c(r))/df at t = 0 is obtained by applying 
the differential operator X to /, where 


X = X^ 




dx ll (c(t)) 

dr 



(5.20) 


that is, 


d/(c(r)) 



= X[fl 


(5.21) 


Here the last equality defines X[f\. It is X — X 1 ' 3/3 .H' which we now define as 
the tangent vector to M at p — c(0) along the direction given by the curve c(r). 


Example 5.7. If X is applied to the coordinate functions q>(c(t)) — x^(t), we 
have 


X[x^] 



d x^(t) 
dr 


t = o 



b 


R 



Figure 5.9. A curve c and a function / define a tangent vector along the curve in terms of 
the directional derivative. 


which is the /ith component of the velocity vector if t is understood as time. 


To be more mathematical, we introduce an equivalence class of curves in M. 
If two curves ci(f) and cjlt) satisfy 


(i) ci(0) = c 2 (0) = p 


(ii) 


djc^(ci(0) 


d t 


dxl l (C2(t)) 


t = 0 


d t 


t = o 


ci (f ) and c '2 ( t ) yield the same differential operator X at p, in which case we define 
ci (?) ~ ci (f ) . Clearly ~ is an equivalence relation and defines the equivalence 
classes. We identify the tangent vector X with the equivalence class of curves 


[c«] = 


c(t) 


c(0) = c( 0) and 


dx M (c(f)) 


d t 


1=0 


d.r /t (c(f)) 

dr 



(5.22) 


rather than a curve itself. 

All the equivalence classes of curves at p e M, namely all the tangent 
vectors at p, form a vector space called the tangent space of M at p, denoted 
by T p M. To analyse T p M, we may use the theory of vector spaces developed in 
section 2.2. Evidently, e p = d/dx ^ (1 < p. < m) are the basis vectors of T p M, 
see (5.20), and dim T p M — dim M. The basis \e p } is called the coordinate 
basis. If a vector V e T p M is written as V = V IL e p , the numbers V 1 ' are called 
the components of V with respect to e p . By construction, it is obvious that a 
vector X exists without specifying the coordinate, see (5.21). The assignment of 


the coordinate is simply for our convenience. This coordinate independence of 
a vector enables us to find the transformation property of the components of the 
vector. Let p e Uj C I Uj and x — (pt ( p), y = <pj ( p). We have two expressions for 
X e T p M , 


X = X' 1 = X' 1 

dxf- 


3 

3 yf- 


This shows that X 1 ' and X 1 ' are related as 


~ 3y^ 

X^ = x v —. (5.23) 

dx v 

Note again that the components of the vector transform in such a way that the 
vector itself is left invariant. 

The basis of T p M need not be \e p }, and we may think of the linear 
combinations e,- = A/^e^, where A = (A,-^) e GL(m, IK). The basis {e,} is 
known as the non-coordinate basis. 


5.2.3 One-forms 


Since T p M is a vector space, there exists a dual vector space to T p M , whose 
element is a linear function from T p M to IK, see section 2.2. The dual space is 
called the cotangent space at p, denoted by T*M. An element co : T p M — >■ IK of 
T*M is called a dual vector, cotangent vector or, in the context of differential 
forms, a one-form. The simplest example of a one-form is the differential d / of 
a function / e T(M). The action of a vector V on / is V[f] — V^df /3x^ e IK. 
Then the action of d/ e T*M on V e T p M is defined by 

(d /, V) = V[f] (5.24) 

Clearly (d /, V) is E-linear in both V and /. 

Noting that d / is expressed in terms of the coordinate x — cp(p ) as 
df — (3//3x /J )d.r /i , it is natural to regard {d.r /i } as a basis of T* M . Moreover, 
this is a dual basis, since 



dx v 

dxt 1 


s;. 


An arbitrary one-form w is written as 


co — to n dx' 1 


(5.25) 


(5.26) 


where the co p are the components of co. Take a vector V — V^d/dx^ and a one- 

P "+ ” ■ p 

3 


■-’ll 

form co — co p dx 1 ' . The inner product ( , ) : T*M x T n M — > IK is defined 

by 


{co, V) = COaV V [dx^, 


dx 1 




(5.27) 



Note that the inner product is defined between a vector and a dual vector and not 
between two vectors or two dual vectors. 

Since to is defined without reference to any coordinate system, for a point 
p e Uj fl Uj , we have 

co — &>, t d.r M = w v dy v 

where x — tpi(p) and y = <pj(p). From dy v = (dy v /dx^dbc^ we find that 

• (5.28) 


5.2.4 Tensors 

A tensor of type (q. r ) is a multilinear object which maps q elements of T*M and 
r elements of T p M to areal number. ‘T p P (M) denotes the set of type ( q , r) tensors 
at p e M. An element of T q rp (M) is written in terms of the bases described earlier 
as 

T = a h;- ■ ■ 3^ 4t "’ - 4 ‘" r - (5 - 29) 

Clearly this is a linear function from 

® q T*M® r T p M 

to ffi. Let Vi = V^d/dx 11 (1 < i < r) and &>,- = a>i p dx ^ (1 < i < q). The action 
of T on them yields a number 

T(to i, i..,a> q \V x,..., V r ) = T^-^ vl ... Vr wi pl . ..co qPq V* 1 . . . V?' . 

In the present notation, the inner product is (&>, X) = co(X). 


5.2.5 Tensor fields 

If a vector is assigned smoothly to each point of M, it is called a vector field 
over M. In other words, V is a vector field if V[f] e fT(M) for any / e fF(M). 
Clearly each component of a vector field is a smooth function from M to ffi. The 
set of the vector fields on M is denoted as X(M). A vector field A at p e M 
is denoted by X\ p , which is an element of T p M. Similarly, we define a tensor 
field of type (q. r) by a smooth assignment of an element of 'Yj. p (M) at each 
point p e M. The set of the tensor fields of type (q , r) on M is denoted by 
T r(M). For example, is the set of the dual vector fields, which is also 

denoted by in the context of differential forms, see section 5.4. Similarly, 

Tq(M) = £F(M) is denoted by in the same context. 




Figure 5.10. A map / : M — > N induces the differential map /* : TpM -*■ T f(p)N . 


5.2.6 Induced maps 

A smooth map f : M N naturally induces a map /* called the differential 
map (figure 5.10), 

/* : T p M -> T np) N. (5.30) 

The explicit form of /* is obtained by the definition of a tangent vector as a 
directional derivative along a curve. If g e T(/V), then go/e S'(M). A vector 
V e 7/,M acts on go/ to give a number V[go /]. Now we define /*V e Tf( p )N 
by 

(/.V)[|] = V[?o/] (5.31) 

or, in terms of charts (C/, <p) on M and (V.i/r) on N, 

( f*V)[g o xlr~ l (y )] = f[go/o (5.32) 

where x = <p(p) and y — Let V — V^d/dx^ and /* V = W a d/dy a . 

Then (5.32) yields 

w a ^-[g o = v^^-[g o f o ^-'(jc)]. 

dy a dx v 

If we take g — y a , we obtain the relation between W a and V ,L , 

0 

= V x —y a {x). (5.33) 

dx v 

Note that the matrix (3y“/3x /i ) is nothing but the Jacobian of the map / : 
M — > N . The differential map /* is naturally extended to tensors of type (q, 0), 

Example 5.8. Let (x\x 2 ) and (y 1 ,y 2 ,y 3 ) be the coordinates in M and N, 

respectively, and let V — ad/dx 1 + bd/dx 2 be a tangent vector at (x,x 2 ). 



Let / : M — > N be a map whose coordinate presentation is y 
(x 1 ,x 2 , y/l — (x 1 ) 2 — (x 2 ) 2 ). Then 


/*y = v ' 4 


dy a 9 
dx 11 dy a 


= a 


_a_ _9_ 

dy 1 + dy 2 



9 

9y 3 


Exercise 5.3. Let f : M —*■ N and g : N -> P . Show that the differential map of 
the composite map g o f : M — >• P is 


(g o /)* = g* o /*. (5.34) 

A map f : M N also induces a map 

/* : T* Hp) N -* P;M. (5.35) 

Note that /* goes in the same direction as f, while /* goes backward, hence 
the name pullback, see section 2.2. If we take V e T p M and a» e T* (p) N, the 
pullback of co by f* is defined by 

</*©,V> = <©,/* V). (5.36) 

The pullback /* naturally extends to tensors of type (0 . r), f* : T ii r f( p> (N) -> 

T^ p (M). The component expression of f* is given by the Jacobian matrix 
(9y"/9.r /i ), see exercise 5.4. 

Exercise 5.4. Let f : M N be a smooth map. Show that for co = co a dy a e 
Tf( p )N, the induced one-form f*co = d.r M e T*M has components 

9y“ 

A =o) a — . (5.37) 

Exercise 5.5. Let / and g be as in exercise 5.3. Show that the pullback of the 
composite map g o f is 

(g°f)* = f*og*. (5.38) 


There is no natural extension of the induced map for a tensor of mixed type. 
The extension is only possible if / : M —*■ N is a diffeomorphism, where the 
Jacobian of f~ l is also defined. 


Exercise 5.6. Let 

T\^-®&x v 

dx v 

be a tensor field of type (1, 1) on M and let / : M -» N be a diffeomorphism. 
Show that the induced tensor on N is 


/* 






where x 1 ' and y a are local coordinates in M and N, respectively. 




Figure 5.11. (a) An immersion / which is not an embedding. ( b ) An embedding g and 
the submanifold gCS 1 ). 


5.2.7 Submanifolds 

Before we close this section, we define a submanifold of a manifold. The meaning 
of embedding is also clarified here. 

Definition 5.3. (Immersion, submanifold, embedding) Let / : M — >• N be a 
smooth map and let dim M < dim N. 

(a) The map / is called an immersion of M into N if /* : T p M — »■ T f {p) N 
is an injection (one to one), that is rank /* = dim M. 

(b) The map / is called an embedding if / is an injection and an immersion. 
The image f(M) is called a submanifold of N. [In practice, / ( M ) thus 
defined is diffeomorphic to M.] 

If / is an immersion, f* maps T p M isomorphically to an m -dimensional 
vector subspace of Tf( P )N since rank/* = dimM. From theorem 2.1, we also 
find ker /* = {0}. If / is an embedding, M is diffeomorphic to f(M). Examples 
will clarify these rather technical points. Consider a map / : 5 1 — > K 2 in figure 
5 . 1 1 (a ) . It is an immersion since a one-dimensional tangent space of S 1 is mapped 
by /* to a subspace of T / (p)R 2 . The image f(S l ) is not a submanifold of K 2 since 
/ is not an injection. The map g : 5 1 -> K 2 in figure 5.11 (Z?) is an embedding and 
g(5\) is a submanifold of M 2 . Clearly, an embedding is an immersion although 
the converse is not necessarily true. In the previous section, we occasionally 
mentioned the embedding of S n into K' ,+1 . Now this meaning is clear; if S ” is 
embedded by / : S n -> K" +1 then S n is diffeomorphic to f(S n ). 


5.3 Flows and Lie derivatives 


Let X be a vector field in M. An integral curve x(t) of A is a curve in M. whose 
tangent vector at x(t) is X\ x . Given a chart (U . <p), this means 


— = X ll (x(t)) 
at 


(5.39) 



where x^(r) is the /xth component of (p(x{t)) and X — X^d/dx^. Note the abuse 
of the notation: x is used to denote a point in M as well as its coordinates. [For 
later convenience we assume the point jc ( 0) is included in U .} Put in another 
way, finding the integral curve of a vector field X is equivalent to solving the 
autonomous system of ordinary differential equations (ODEs) (5.39). The initial 
condition Xq — x^(0) corresponds to the coordinates of an integral curve at t = 0. 
The existence and uniqueness theorem of ODEs guarantees that there is a unique 
solution to (5.39), at least locally, with the initial data Xg. It may happen that 
the integral curve is defined only on a subset of M, in which case we have to 
pay attention so that the parameter t does not exceed the given interval. In the 
following we assume that t is maximally extended. It is known that if M is a 
compact manifold, the integral curve exists for all f el 

Let a(t, xo) be an integral curve of X which passes a point xo at t — 0 and 
denote the coordinate by a^(t, xo)- Equation (5.39) then becomes 

^(t, x 0 ) = X»(a(t, x 0 )) (5.40a) 

dr 

with the initial condition 

<7^(0, x 0 ) = x'\ (5.40b) 

The map a : M x M — > M is called a flow generated by X e X(M). A flow 
satisfies the rule 

(7 (r, (s, Xo)) — (T (t T 5, Xo) (5.41) 

for any s,/el such that both sides of (5.41) make sense. This can be seen from 
the uniqueness of ODEs. In fact, we note that 

-^-cr ,l (r, a M Cs, xo)) = X^(a{f, a^(s, x 0 ))) 
dr 

er(0, cr(j, xo)) = <j(s, xo) 


and 


-^-cr ,l (f + s,x 0 ) = - — - — -o ,/i (r + 5 , x 0 ) = X^{a{t + s,xo)) 
dr d(r + s) 

o (0 + s, Xo) = cs(s, Xo). 

Thus, both sides of (5.41) satisfy the same ODE and the same initial condition. 
From the uniqueness of the solution, they should be the same. We have obtained 
the following theorem. 

Theorem 5.1. For any point x e M, there exists a differentiable map a : MxM -» 
M such that 

(i) er(0, x) = x; 

(ii) r i-> cr(r, x) is a solution of (5.40a) and (5.40b); and 



(iii) (j (Y, o^is, x)) — a{t + s, x). 


[Note: We denote the initial point by x instead of xq to emphasize that er is a map 
M x M -> M .] 

We may imagine a flow as a (steady) stream flow. If a particle is observed at 
a point x at t — 0, it will be found at a(t, x) at later time t. 

Example 5.9. Let M = IR 2 and let X((x, y)) = — y d/dx + xd/dy be a vector 
field in M. It is easy to verify that 

a(t, (x, >’)) = (x cos t — y sin f , x sin t + y cos t) 

is a flow generated by X. The flow through (x, y) is a circle whose centre is at 
the origin. Clearly, er(f, (x, y)) = (x, y) if t — 2nn, n e Z. If (x, y) = (0, 0), 
the flow stays at (0, 0). 

Exercise 5.7. Let M — R 2 , and let X = y d/dx + xd/dy be a vector field in M. 
Find the flow generated by X. 

5.3.1 One-parameter group of transformations 

For fixed t e ffi, a flow a it, x) is a diffeomorphism from M to M , denoted by 
o> : M M. It is important to note that a, is made into a commutative group by 
the following rules. 

(i) 07 (< 7 j(x)) = o> +s (x), that is, o, o a s = o t+s \ 

(ii) cro = the identity map (= unit element); and 

(iii) o- t = 

This group is called the one-parameter group of transformations. The 

group locally looks like the additive group ffi, although it may not be isomorphic 
to M globally. In fact, in example 5.9, ojnn+t was the same map as a, and we find 
that the one -parameter group is isomorphic to SO(2), the multiplicative group of 
2x2 real matrices of the form 


( cos 9 — sin0 

sint? cost? 

or U(l), the multiplicative group of complex numbers of unit modulus e.'" . 

Under the action of a E , with an infinitesimal e, we find from (5.40a) and 
(5.40b) that a point x whose coordinate is x 1 ' is mapped to 

cr' l {x) = a^(s, x) = x^ + eX^(x). (5.42) 


The vector field X is called, in this context, the infinitesimal generator of the 
transformation a, . 



Given a vector field X, the corresponding flow a is often referred to as the 
exponentiation of X and is denoted by 


= exp (tX)x^. 


(5.43) 


a 


The name ‘exponentiation’ is justified as we shall see now. Let us take a parameter 
t and evaluate the coordinate of a point which is separated from the initial point 
x — er(0, x) by the parameter distance t along the flow a. The coordinate 
corresponding to the point a(t, x ) is 


2 


f2 

s =o + 2! 




a^(s,x) +■•■ 


.5=0 



(5.44) 


The last expression can also be written as a^{t, x) — exp (tX)x^, as in (5.43). 
The flow a satisfies the following exponential properties. 




(iii) a(t, a(s, x)) — cr(f, exp(i'Z)x) = exp(tX) exp(sX)x 


= exp{(r + s)X}x = cr(r + 5 , x). (5.45c) 


5.3.2 Lie derivatives 

Let a{t, x) and r(t, x) be two flows generated by the vector fields X and Y, 



(5.46a) 


(5.46b) 


Let us evaluate the change of the vector field Y along a (s, x ). To do this, we have 
to compare the vector Y at a point x with that at a nearby point x' = cr e (x), 
see figure 5.12. However, we cannot simply take the difference between the 
components of Y at two points since they belong to different tangent spaces 
T p M and T„ r (A ; M ; the naive difference between vectors at different points is 
ill defined. To define a sensible derivative, we first map Y \ a ( X ) to T X M by 
(er_ £ )* : T ae ( X )M — > T X M, after which we take a difference between two vectors 
(cr_ £ )* Y\ a t x \ and T| v , both of which are vectors in T X M . The Lie derivative of 
a vector field Y along the flow a of X is defined by 



(5.47) 



Figure 5.12. To compare a vector Y\ x with Y \a e ( x ), the latter must be transported back to 
x by the differential map (o_ e )*. 


Exercise 5.8. Show that is also written as 

C X Y = lim -[FI* - (o- e )*F| (7 _ e( * ) ] 

s — ^0 G 

= lim ~[Y\a e (x) - (<*e)*Y\x]- 

£ — >0 G 

Let ( U , i p) be a chart with the coordinates x and let X = X^d/dx 11 and 
Y — Y^d/'dx 1 ' be vector fields defined on U. Then oy (x ) has the coordinates 
x ^ + eX^{x) and 


Y\ ae(x) = Y»(x v + eX v (.x))e ll \ x + e x 

~ [Y»{x) + eX^(x)d v Y^(x)]e li \ x+eX 

where {e^} — [d/dx^] is the coordinate basis and 3 y = d/dx v . If we map this 
vector defined at er £ (x) to x by (a_ £ )*, we obtain 

[Y»(x) + eX x (x)d l Y> i (x)]d fl [x v - eX v {x)]e v \ x 

= [Y^(x) + eX\x)d xY^ixml - ed, L X v (x)]e v \ x 
= Y^(x)e„.\ x + e[^(x)3 M F y (x) - Y^x) d„X v (x)]e v \ x + 0(e 2 ). 

(5.48) 


From (5.47) and (5.48), we find that 


C X Y = (X% Y v - 3nX v )e v . 


(5.49a) 



Exercise 5.9. Let X — X^d/dx 11 and Y — Y^d/dx^ be vector fields in M. 
Define the Lie bracket [X, Y] by 

[X, Y]f = X[Y[f]\ - Y[X[f]] (5.50) 

where / e 'J(M). Show that [X . Y] is a vector field given by 

(X ll 8 ll Y v - 

This exercise shows that the Lie derivative of Y along X is 

CxY = [X, Y], (5.49b) 

[Remarks: Note that neither XY nor YX is a vector field since they are second- 
order derivatives. The combination [ X . Y] is, however, a first-order derivative and 
indeed a vector field.] 

Exercise 5.10. Show that the Lie bracket satisfies 

(a) bilinearity 


[X, ciYi + c 2 Y 2 ] = ci[X, Ti] + c 2 [X, Y 2 ] 

IciXi + c 2 X 2 , Y] = C1 [X U Y] + c 2 [X 2 , Y] 

for any constants ci and c 2 , 

(b) skew-symmetry 

[X, Y] - -[YX] 

(c) the Jacobi identity 

[[X, Y ], Z] + [[Z, X], Y] + [[Y, Z], X] = 0. 

Exercise 5.11. (a) Let X,Y e X(M) and / e 'J(M ). Show that 

C fx Y =f[X, Y] - Y[f]X (5.51a) 

£.x(fY)=f[X,Y] + X[f]Y. (5.51b) 

(b) Let X,Ye X(M) and / : M — »■ N. Show that 

MX, Y] = [UX, UY]. (5.52) 

Geometrically, the Lie bracket shows the non-commutativity of two flows. 
This is easily observed from the following consideration. Let cr(s, x) and r(f, x) 
be two flows generated by vector fields X and Y, as before, see figure 5.13. If we 
move by a small parameter distance s along the flow a first, then by S along r, 
we shall be at the point whose coordinates are 

r M (<5, cr(e, x)) ~ r ^(<5, x v + sX v (x)) 

~ x ;t + sZ^(x) + SY lx (x v + eX v (x )) 

~ x M + eX ll (x) + 8Y»(x) + s8X v (x)d v Y v (x). 


o(e, t(<8, x)) 


T (t, X) 


t( 6, x ) 



a(s, x) 


Figure 5.13. A Lie bracket [X, Y] measures the failure of the closure of the parallelogram. 


If, however, we move by 8 along r first, then by e along a, we will be at the point 

cr^te, r(S, x )) — cr M (e, x v + 8Y v (x)) 

~ x^ + SY^ix) + eX IAj (x v + 8Y v (x )) 

~ x^ + 8Y^{x) + eX^ix) + s8Y v (x)d v X ll (x). 


The difference between the coordinates of these two points is proportional to the 
Lie bracket, 

r *(8, cr(s, x)) - a^ie, r («5, x)) = s8[X, F] M . 


The Lie bracket of X and Y measures the failure of the closure of the 
parallelogram in figure 5.13. It is easy to see CxY = [X, Y] = 0 if and only 
if 

G(s,r(t,x)) — x(t,a(s,x)). (5.53) 


We may also define the Lie derivative of a one-form a> e QUM) along 
X e KM) by 

C x co = lim -[(ae)*co\ aAx) - co\ x ~\ (5.54) 

s — ^0 £ 

where co\ x e T*M is cx> at x. Put co = oj^dx 11 . Repeating a similar analysis as 
before, we obtain 


{o £ )*(o \ as ( x ) = &), x (x) dx'' + e[X v (x)d v co, l (x) + 9 /x X v (x)at v (x)] dx'' 
which leads to 

Cxco = ( X l ’d v co p + d IL X v M v ) dx'". (5.55) 

Clearly Cxco e T*(M), since it is a difference of two one-forms at the same point 


x. 



The Lie derivative of / e “S(M ) along a flow a s generated by a vector field 

X is 

C x f = lim ~[f(a e (x)) - f(x )] 

g — ^0 £ 

= lim -[fix' 1 + eX^fx)) - f(x^)] 

s — >0 £ 

= = XI ft (5.56) 

dx^ 

which is the usual directional derivative of f along X. 

The Lie derivative of a general tensor is obtained from the following 
proposition. 

Proposition 5.1. The Lie derivative satisfies 

£x(h + h) = C-xh + Gxti (5.57a) 

where t\ and ti are tensor fields of the same type and 

Cx(t\ <8> L) = (£xh) ® t 2 + t\ ® ( Lxh ) (5.57b) 

where t\ and ti are tensor fields of arbitrary types. 

Proof, (a) is obvious. Rather than giving the general proof of (b), which is full 
of indices, we give an example whose extension to more general cases is trivial. 
Take Y e X(M) and co e Q}(M) and construct the tensor product Y <8> co. Then 
(Y <g> co)\ ae(x) is mapped onto a tensor at x by the action of (cr_ £ )* <g> (er £ )*: 

[(«■-«)* ® (eL)*](X ® co)\<, s (x) = [(tr_ e )*F <8> (c e )*a>i\x- 

Then there follows (the Leibnitz rule): 

C X (Y ®co)= lim -[{(ct_ £ )*F <g> (cr £ )*tt>}| x - (Y ® co)\ x ] 

£ — ^0 £ 

= lim -[(er_ £ )*F <g> {(cr £ )*tt> - co} + {(ct_ £ )*F - Y} <g> co] 

G — >0 £ 

= Y <g> (Cxco) + (CxY) ® co. 

Extensions to more general cases are obvious. □ 

This proposition enables us to calculate the Lie derivative of a general tensor 
field. For example, let t — t^ v dx^ ® e v e T\(M). Proposition 5.1 gives 

C x t = X[t/] dx ,x <g> e v + t/iCx d-v' 4 ) ® e v + t/ dx^ <g> (C x e v ). 

Exercise 5.12. Let / be a tensor field. Show that 


A X,Y]t — GxGyt - CyCxt. 


(5.58) 



5.4 Differential forms 


Before we define differential forms, we examine the symmetry property of 
tensors. The symmetry operation on a tensor co e T° r p (M) is defined by 

Pco(V 1, . . . , Vr) = co(Vp ( 1), . . . , V P(r) ) (5.59) 

where Vj e T p M and P is an element of S r , the symmetric group of order r. 
Take the coordinate basis {e p } = {d/dx^}. The component of co in this basis is 

e^ 2 , . . . , e Pr ) — a>p lf j, 2 ...p, r . 

The component of Poo is obtained from (5.59) as 

Pwiefn , e P2 , . . . , e Pr ) — t^iip^n-Pcio-.-ppij) • 

For a general tensor of type (q, r), the symmetry operations are defined for q 
indices and r indices separately. 

For co e 7? p (M), the symmetrizer § is defined by 

Sco — — y Pco (5.60) 

Y ! t ^ 

' PeS r 

while the anti-symmetrizer A is 

1 \ - 

Aco — — } sgn (P)Pco (5.61) 

Y ' ‘ ^ 

' PeS r 

where sgn(P) = +1 for even permutations and —1 for odd permutations. § oo is 
totally symmetric (that is, P Sco — Sco for any P e S r ) and Aoo is totally anti- 
symmetric ( PAco — sgn(P)/l&>). 


5.4.1 Definitions 


Definition 5.4. A differential form of order r or an r-form is a totally anti- 
symmetric tensor of type (0, r). 


Let us define the wedge product A of r one-forms by the totally anti- 
symmetric tensor product 

d.r ,ll Ad.r' l2 A...Ad/' = ^ sgn(B) dx Mp w Adx Mi> < 2 > A. . . Adx^w. (5.62) 

PeS r 


For example. 


dx ^ A dx y = dx /J <g> dx v — dx v <g> dx /x 
dx 1 A dx ^ A dx v = dx* <g> dx M <g> dx v + dx v <g> dx* ® dx /x 

+ dx M <g> dx v ® dx* — dx x <g> dx v ® dx 11 
— dx v <g> dx 1 * <S> dx* — dx M <g> dx* <g> dx v . 



It is readily verified that the wedge product satisfies the following. 

(i) dx Ml A . . . A dx^ r = 0 if some index ji appears at least twice. 

(ii) dx Ml A ... A Ax^ r = sgn(P) dx^Ai) A ... A 

(iii) dx /J| A ... A dx ,lr is linear in each dxP. 

If we denote the vector space of r-forms at p e M by Q r p (M), the set of 
/■-forms (5.62) forms a basis of Q,' p (M) and an element co e Q' p (M) is expanded 
as 

co — — co PlP2 ... Pr dx w A dx M2 A ... A dx llr (5.63) 

where co plP2 ... Pr are taken totally anti-symmetric, reflecting the anti-symmetry 
of the basis. For example, the components of any second-rank tensor oo p v are 
decomposed into the symmetric part a pv and the anti-symmetric part a pv : 

cifx V —( 0(^1 >) = (5.64a) 

= (0[pv] = (5.64b) 

Observe that a pv dx ^ A dx v = 0, while a pv dx ^ A dx y = co pv dx^ L A dx y . 

Since there are (”') choices of the set (fi\, /r 2, ■ . . , p, r ) out of (1,2,..., m) 
in (5.62), the dimension of the vector space Q' p ( M ) is 

( m\ ml 

r J (m — r)\r\ 

For later convenience we define £2° p (M) — K. Clearly £2 l p (M) — T*M. If 
/■ in (5.62) exceeds m, it vanishes identically since some index appears at least 
twice in the anti-symmetrized summation. The equality (”') = ( ™ ) implies 
dim Q' p (M) — dim Q' p ~'(M). Since Q. r p {M) is a vector space, £l' p (M) is 
isomorphic to Q,' p ~ r (M ) (see section 2.2). 

Define the exterior product of a //-form and an / -form A : Q q p (M ) x 
Q r p (M) -> (M) by a trivial extension. Let co e Q, q p (M) and f e C2' p (M), 

for example. The action of the (q + rf-form co A £ on q + r vectors is defined by 


(CO A $)(Vi,.. 

1 


y, sgn(P)ft/(yp(i), . . . , Vp( q ))%{Vp( q+ 1), . . . , Vp( q+r} ) 


where V) e T p M. If q + r > m, co A £ vanishes identically. With this product, 
we define an algebra 

Q* p (M) = Q? p (M) ffi £2 l p (M) © . . . ® «™(Af). 


(5.66) 



Table 5.1. 


/■-forms 

Basis 

Dimension 

Q°(M) = F{M) 

11} 

1 

Q l (M ) = T*M 

{dx* 2 } 

m 

Qr(M) 

{dx^i A dx^ 2 } 

m(m — l)/2 

Q. 3 (M) 

{dv^ 1 A dx^ 2 A dv /i3 } 

m(m — 1 )(m — 2) / 6 

Q. m (M) 

(dr 1 A dr 2 A . . . dx m ) 

1 


£2* p (M) is the space of all differential forms at p and is closed under the exterior 
product. 

Exercise 5.13. Take the Cartesian coordinates (x, y) in M 2 . The two-form dx Ady 
is the oriented area element (the vector product in elementary vector algebra). 
Show that, in polar coordinates, this becomes rdr A AO. 

Exercise 5.14. Let§ e £2p(M), i] e Q p (M) and co e Q p (M). Show that 

f A £ = 0 if q is odd (5.67a) 

(Ai) = (-lfi)A? (5.67b) 

(§ A rj) A co — f A (/; A &>). (5.67c) 

We may assign an r- form smoothly at each point on a manifold M. We 
denote the space of smooth r -forms on M by Q r ( M ). We also define Q (> ( M ) to 
be the algebra of smooth functions, T(M). In summary we have table 5.1. 

5.4.2 Exterior derivatives 

Definition 5.5. The exterior derivative d r is a map £}' (M) -» £2 r+l (M) whose 
action on an r-form 

CO = —COfn.,.^ dx Ml A ... A dx ,tr 

is defined by 


d ,co = 


dx 1 


-co, 


M1-/V 


dx v A dx ^ 1 A ... A dx flr . 


(5.68) 


It is common to drop the subscript r and write simply d. The wedge product 
automatically anti-symmetrizes the coefficient. 



Example 5.10. The / -forms in three-dimensional space are: 


(i) coo = fix, y, z), 

(ii) an = a> x (x, y, z) dx + co y (x, y, z) dy + a> z (x, y, z) dz, 

(iii) (02 = a> xy (x, y, z) dx A dy + co yz (x , y , z) d v A dz + a> zx (x, y, z) dz A dx 

and 

(iv) co 3 = co xyz (x, y, z) dx A dy A dz. 

If we define an axial vector a 11 by eP' vl co v \, a two-form may be regarded as a 
‘vector’ . The Levi-Civita symbol g /ivA j s defined by S P <OP(2)PO) _ S g n (p) and 
provides the isomorphism between X(M) and Q 2 (M). [Note that both of these 
are of dimension three.] 

The action of d is 


3 f df 3 / 

(t) dcoo = — dx + — dy + — dz, 
dx dy dz 

{ 3 <o y dco x 


(ii) d«i = — ) dx A dy + ( ^ 

' da> x 3 a> z 


/ 3 a> z 


dz dx 
dco 


(iii) d&>2 = 

(iv) dtt >3 = 0 


\ dx dy 
dz A dx , 
dco 7X dco 


V dy 


yz 


dx 


+ 


dy 


+ 


•xy 


dz 


dx A dy A dz and 



dy A dz 


Hence, the action of d on a>o is identified with ‘grad’, on co\ with ‘rot’ and on a >2 
with ‘div’ in the usual vector calculus. 


Exercise 5.15. Let£ e £l q (M) and co e Q r (M). Show that 

d(f Aft.) = d£ AW+ (-l) 9 f A dm. (5.69) 

A useful expression for the exterior derivative is obtained as follows. Let us 
take X = X^d/dx 11 , Y = Y v d/dx v e X(M) and co = oj, a dx' 1 e £2 l (M). It is 
easy to see that the combination 

X[co(Y)] - Y[co(X)} - w([X, Y]) = ^(X v V M - X M Y V ) 
is equal to dco(X, Y), and we have the coordinate-free expression 

dco (X, Y) = X[co{Y)] - Y[co(X)] - co([X, T]). (5.70) 

For an r-form co e Q r (M), this becomes 
dco(X u ...,X r+l ) 

r 

= iy +1 xMXi’- x r+1 ) 

i=t 

+ Y. ( - ])i+J ° j([X ” Xi, . . . , Xj, . . . , Xj , . . . , X r+l ) (5.71) 

i<j 



where the entry below " has been omitted. As an exercise, the reader should 
verify (5.71) explicitly for r = 2. 

We now prove an important formula: 

d 2 = 0 (or d / + i d r = 0). (5.72) 


Take 


= —(o p d a- ^ 1 A ... A dx^ r e £2 r (M). 


The action of d~ on co is 


dr co = - 3 dx x A dx v A d.r Ml A ... A dx^ . 

r\ dx x dx v 


This vanishes identically since d 2 co Pl ... pLr /dx x dx v is symmetric with respect to X 
and v while dx A dx v is anti-symmetric. 


Example 5.1 1. It is known that the electromagnetic potential A — (0, A) is a 
one-form, A = A fl dx 11 (see chapter 10). The electromagnetic tensor is defined 
by F = dA and has the components 


( o -E x —E y -E x \ 

E x 0 B- -B y 

Ey —B z 0 B x 

\ E Z By — By 0 J 


(5.73) 


where 

9 

E = — V0 -A and B = V x A 

9v u 

as usual. Two Maxwell equations, V ■ B = 0 and 9B /dt = —V x E follow from 
the identity d F = d(dA) = 0, which is known as the Bianchi identity, while the 
other set is the equation of motion derived from the Lagrangian (1.245). 


A map / : M — > N induces the pullback /* : T* (p) N — > T*M and 
/* is naturally extended to tensors of type (0, r); see section 5.2. Since an 
r-form is a tensor of type (0, r). this applies as well. Let co e Q r (N) and 
let / be a map M -* N. At each point f(p) e N, f induces the pullback 
r : Q' /(/)1 ,Y •-> <y p M by 

(f*co)(X 1 , . . . , X r ) = coiUX 1 , . . . , f,X r ) (5.74) 

where X ; e T p M and /* is the differential map T p M — > Tf( p) N. 

Exercise 5.16. Let f , co e Q r ( N ) and let / : M N. Show that 


d (f*co) = f*(dco) 

/*« A 0>) = (/*£) A (/*©). 


(5.75) 

(5.76) 


The exterior derivative d r induces the sequence 

0 -4 Q°(M) £2*(M) ^ Q m (M) 0 (5.77) 

where i is the inclusion map 0 ^ This sequence is called the de Rham 

complex. Since d 2 = 0, we have imd,. C kerd r +i . [Take co e Q' (M). Then 
d r co e imd r and d r _|_i (d r cu) = 0 imply d r co e kerd /+ i.] An element of kerd, is 
called a closed r-form, while an element of irn d r _i is called an exact r-form. 
Namely, co e Q'(M) is closed if do) = 0 and exact if there exists an (r — l)-form 
i// such that co = di ]r. The quotient space kerd,/im d r -\ is called the rth de 
Rham cohomology group which is made into the dual space of the homology 
group; see chapter 6. 

5.4.3 Interior product and Lie derivative of forms 

Another important operation is the interior product ix : £i r (M) — > £2 r-1 (M), 
where X e X(M). For co e £2 r (Af), we define 

ixa>(Xu Xr-i) = co{X, Xu..., Xr-i). (5.78) 

For X = X /l 8/dx 11 and co — (1 /r^co^...^ dx M1 A ... A dx /lr we have 

ixoo = ^ X v co V u, 2 ..,n r dx ^ 2 A ... A dx^ 

1 r _ 

= — dx w A ... A dx lls A ... A dx'' 1 ' 

S= 1 

(5.79) 

where the entry below " has been omitted. For example, let (x, y, z) be the 
coordinates of M 3 . Then 

i ex (dx A dy) = dy , i ex (dy A dz) = 0, i ex (dz A dx) = — dz. 

The Lie derivative of a form is most neatly written with the interior product. 
Let co — co^ dx 1 ' be a one-form. Consider the combination 

(dix + ixd )co = d (X''&> ; J + ix[ j(d^oo v - 9,,^) dx'' A dx v ] 

= {co^dvX^ + X^dvCOft) dx v + X ll (d^ao v - d v co^) dx y 
= {co^dvX^ + X»d tl co v )dx v . 

Comparing this with (5.55), we find that 


£ x «o = (dix + ixd)«. 


(5.80) 


For a general /'-form to = (1 /r!)£w Jttl ... JU , I . dx / ' 1 A ... A dx^, we have 
= lim -((o E )*co\ ai{x) - wU) 

£ — >0 G 

= X v ^a v w Ml ... /Xr dx ,il A ... A dx Mr 

' i * 

+ J2 a ^ xV -^^.-x.:^dx^A...Adx^. (5.81) 

5 = 1 ’ 

We also have 
(dix + ixd)a> 

l r 
£=1 

X (— 1) J_1 d.T y A dx ^ 1 A ... A dx^s A dx^ 

+ -^[Z v a v <u A[ 1 ... /ir dx ,il A ... A dx ^ 
r 

+ J 2 x ^ C 0 ^-^-^r(- i ydx v A dx^ A ... A dx^ A ... A dx^] 
s= 1 

1 r 

= — '^ / [dvX^ s CO lll ...n. s ... tlr (— l) s_1 dx V A dx M1 A ... A dx^ 1 A ... A dx^ 

s= 1 

+ ^yX y a y tt» M1 ... lUr dx w A ... A dx /tr . 

If we interchange the roles of /x v and v in the first term of the last expression and 
compare it with (5.81), we verify that 

(dix + ix d)a> = Cxto (5.82) 

for any r-fonn to. 

Exercise 5.17. Let X. Y e X(M) and to e Q r (M). Show that 

i[X,F]tt> = X(iyco) — Y (ixw). (5.83) 

Show also that ix is an anti-derivation, 

ix(tt> A rf) = ix« A /; + (— 1 ) r u> A ixrj (5.84) 

and nilpotent, 

i 2 x = 0. (5.85) 

Use the nilpotency to prove 


£xixtt> = ix£x«- 


(5.86) 



Exercise 5.18. Let t e 7 n m (M). Show that 


+ J2 d Vs X't^X - £ dxX^ . (5.87) 

5=1 5 = 1 

Example 5.12. Let us reformulate Hamiltonian mechanics (section 1.1) in terms 
of differential forms. Let H be a Hamiltonian and (q 11 , p /; ) be its phase space. 
Define a two-form 

co = dp fl A d <7^ (5.88) 

called the symplectic two-form. If we introduce a one-form 

e = q 11 d Ptl , (5.89) 

the symplectic two-form is expressed as 

co = dO. (5.90) 


Given a function / ( q , p) in the phase space, one can define the Hamiltonian 
vector field 

9/9 9/9 

X f = 1 X-—- ir L 7 - — . (5.91) 


9/fy dq V dq^ dp^ 


Then it is easy to verify that 


3 / , a 3 / 

1 x f co = -— d P 


— d = -df. 

9 Pli dq» 


Consider a vector field generated by the Hamiltonian 

8H 9 dH 9 

X H — 


dpn dqV dq^ L dp^ 

For the solution (q 11 . p^) to Hamilton’s equation of motion 

d q^ dH dpn 

dr dp^ dr 

we also obtain 


8H 
dqL 1 ’ 


X H — 


dp^ 9 d 9 d 

dr dp M d t dqv- dr 


(5.92) 


(5.93) 


(5.94) 


The symplectic two-form co is left invariant along the flow generated by Xh. 


£x h oj — d(i x H a>) + \x H (dw) 

= d(i x H cS) = -d 2 H = 0 (5.95) 

where use has been made of (5.82). Conversely, if X satisifes Cx(» = 0, there 
exists a Hamiltonian H such that Hamilton’s equation of motion is satisfied 



along the flow generated by X. This follows from the previous observation that 
Cxoj = d(ix&>) = 0 and hence by Poincare’s lemma, there exists a function 
H (q , p) such that 

i x co = — d H. 

The Poisson bracket is cast into a form independent of the special coordinates 
chosen with the help of the Hamiltonian vector fields. In fact, 


ix f (ix g co) = -i Xf (dg) = 


dq M dPn 


dq V dp p 


If, g]PB- 


(5.96) 


5.5 Integration of differential forms 
5.5.1 Orientation 

An integration of a differential form over a manifold M is defined only when 
M is ‘orientable’. So we first define an orientation of a manifold. Let M be 
a connected m -dimensional differentiable manifold. At a point p e M, the 
tangent space T p M is spanned by the basis {e^} = {3/3x M }, where x ^ is the 
local coordinate on the chart U ,■ to which p belongs. Let Uj be another chart such 
that U, ni/,^0 with the local coordinates y a . If p e Uj fl Uj, T p M is spanned 
by either {e p } or {io.} = { 3/3 v“ }. The basis changes as 



If J — det(3x /i /3 y a ) > 0 on (7/ fl Uj, {e p } and { 7 a } are said to define the same 
orientation on t/; fl Uj and if J < 0, they define the opposite orientation. 

Definition 5.6. Let M be a connected manifold covered by {(/, }. The manifold 
M is orientable if, for any overlapping charts Ui and Uj, there exist local 
coordinates {x^} for U, and {y“} for Uj such that J — det (dx^/dy 01 ) > 0. 

If M is non-orientable, J cannot be positive in all intersections of charts. 
For example, the Mobius strip in figure 5.14(a) is non-orientable since we have 
to choose J to be negative in the intersection B. 

If an m -dimensional manifold M is orientable, there exists an m-form u> 
which vanishes nowhere. This w-form o> is called a volume element, which 
plays the role of a measure when we integrate a function / e 3 (M ) over M. 
Two volume elements co and of are said to be equivalent if there exists a strictly 
positive function li e 3 (M) such that a> = hco' . A negative-definite function 
W e 3(M ) gives an inequivalent orientation to M. Thus, any orientable manifold 
admits two inequivalent orientations, one of which is called right handed, the 
other left handed. Take an w-form 


co — h(p) dx 1 A . . . A dx" ! 


(5.98) 



Figure 5.14. (a) The Mobius strip is obtained by twisting the part B' of the second 
strip by n before pasting A with A' and B with B'. The coordinate change on B is 
y 1 = x 1 , y~ = —x~ and the Jacobian is —1. (b) Basis frames on the Mobius strip. 


with a positive-definite hip) on a chart (U, (p) whose coordinate is x — (pip). 
If M is orientable, we may extend o> throughout M such that the component 
h is positive definite on any chart (/;. If M is orientable, this co is a volume 
element. Note that this positivity of h is independent of the choice of coordinates. 
In fact, let p e U ,• fi £7/^0 and let x M and y a be the coordinates of Uj and Uj, 
respectively. Then (5.98) becomes 

9 ( dx^\ 

co = h{p) dv^ 1 A ... A dy^ 7 " = hip) det ( ) dv 1 A . . . A dy m . 

9y/il 9yMm ' y dy v J 

(5.99) 

The determinant in (5.99) is the Jacobian of the coordinate transformation and 
must be positive by assumed orientability. If M is non-orientable, co with a 
positive-definite component cannot be defined on M. Let us look at figure 5.14 
again. If we circumnavigate the strip along the direction shown in the figure, 
co = dx A dy changes the signature dx A dy -> — dx A dy when we come back to 
the starting point. Hence, co cannot be defined uniquely on M. 


5.5.2 Integration of forms 

Now we are ready to define an integration of a function / : M — > IT over an 
orientable manifold M. Take a volume element co. In a coordinate neighbourhood 
Uj with the coordinate x, we define the integration of an m-form fco by 



fiq>i \x))hi<p: 1 (x))dx 1 ...dx m . 


(5.100) 



The RHS is an ordinary multiple integration of a function of m variables. Once 
the integral of / over U, is defined, the integral of / over the whole of M is given 
with the help of the ‘partition of unity’ defined now. 

Definition 5.7. Take an open covering {[/,} of M such that each point of M is 
covered with a finite number of f/;. [If this is always possible, M is called 
paracompact, which we assume to be the case.] If a family of differentiable 
functions sfip) satisfies 

(i) 0 < sfip) < 1 

(ii) St (p) — 0 if p £ Ui and 

(iii) £i ( p) + e 2 (p) + . . . = 1 for any point p e M 

the family {£(/?)} is called a partition of unity subordinate to the covering {[/,}. 


From condition (iii), it follows that 

/(F) = /(F)e/F) = /'(F) (5-101) 

i i 


where f] (p) = f(p)si(p) vanishes outside Ui by (ii). Hence, given a point 
p e M, assumed paracompactness ensures that there are only finite terms in the 
summation over i in (5.101). For each f, (p), we may define the integral over U j 
according to (5.100). Finally the integral of / on M is given by 



(5.102) 


Although a different atlas {(Vi, fit)} gives different coordinates and a different 
partition of unity, the integral defined by (5.102) remains the same. 

Example 5.13. Let us take the atlas of S l defined in example 5.2. Let U \ = 
S l - {(1, 0)}, U 2 = S x - {(-LO)}, £i(6>) = sin 2 (0/2) and s 2 (0) = cos 2 (6»/2). 
The reader should verify that {£;($)} is a partition of unity subordinate to {Ui}. 
Let us integrate a function / = cos 2 6, for example. [Of course we know 


p27T 

Jo ' 


A0 cos" 0 = 7T 


but let us use the partition of unity.] We have 


[ A0 cos 2 0=[ i 

Js l Jo 


A0 sin 2 — cos 2 0 + 


f 


A6 cos — cos" 
2 


— X. 1-rr — 

— 2 7T ~r 2 ^ — 7T. 


So far, we have left h arbitrary provided it is strictly positive. The reader 
might be tempted to choose h to he unity. However, as we found in (5.99), h 
is multiplied by the Jacobian under the change of coordinates and there is no 
canonical way to single out the component h ; unity in one coordinate might not 
be unity in the other. The situation changes if the manifold is endowed with a 
metric, as we will see in chapter 7. 


5.6 Lie groups and Lie algebras 

A Lie group is a manifold on which the group manipulations, product and inverse, 
are defined. Lie groups play an extremely important role in the theory of fibre 
bundles and also find vast applications in physics. Here we will work out the 
geometrical aspects of Lie groups and Lie algebras. 

5.6.1 Lie groups 

Definition 5.8. A Lie group G is a differentiable manifold which is endowed with 
a group structure such that the group operations 

(i ) • : G x G -» G, (gi, gi) gi • g 2 

(ii) ~ l :G-> G, gv+ g~ l 

are differentiable. [Remark: It can be shown that G has a unique analytic structure 
with which the product and the inverse operations are written as convergent power 
series.] 

The unit element of a Lie group is written as e. The dimension of a Lie group 
G is defined to be the dimension of G as a manifold. The product symbol may be 
omitted and gi • g 2 is usually written as gig 2 - For example, let IR* = M— {0}. Take 
three elements x, y.z € TO* such that xy = z. Obviously if we multiply a number 
close to x by a number close to y, we have a number close to z. Similarly, an 
inverse of a number close to x is close to I /x. In fact, we can differentiate these 
maps with respect to the relevant arguments and M* is made into a Lie group with 
these group operations. If the product is commutative, namely gig 2 = g 2 gi, we 
often use the additive symbol + instead of the product symbol. 

Exercise 5. 1 9. 

(a) Show that K + = {x e M| jc > 0} is a Lie group with respect to 
multiplication. 

(b) Show that E is a Lie group with respect to addition. 

(c) Show that M 2 is a Lie group with respect to addition defined by (xi , y i ) + 
(*2, yi) = (At + X 2 , y i + yi)- 

Example 5.14. Let S' 1 be the unit circle on the complex plane, 

S 1 = {e ie \0eR (mod 2 n)}. 

The group operations defined by e^e 1 ^ = and (e l6l ) _1 = e -10 are 

differentiable and .S' 1 is made into a Lie group, which we call U(l). It is easy 
to see that the group operations are the same as those in exercise 5. 19 (ft) modulo 
2jt. 


Of particular interest in physical applications are the matrix groups which 
are subgroups of general linear groups GL(«, ffi) or GL(n, C). The product of 



elements is simply the matrix multiplication and the inverse is given by the matrix 
inverse. The coordinates of GL (n, M) are given by nr entries of M — {x,y}. 
GL(«. K) is a non-compact manifold of real dimension n 2 . 

Interesting subgroups of GL (n. K) are the orthogonal group O(n), the 
special linear group SL(«, IK) and the special orthogonal group SO(h): 

O(n) = {M e GL(n, W)\MM l = M l M = /„} (5.103) 

SL(n, IK) = {M e GL (n, ffi)| det M = 1} (5.104) 

SO(n) = 0(n) IT SL(n, IK) (5.105) 

where ( denotes the transpose of a matrix. In special relativity, we are familiar 

with the Lorentz group 

0(1, 3) = {M e GL(4, K)| MrjM' = ;/} 

where i] is the Minkowski metric, i] = diag(— 1, 1, L 1)- Extension to higher- 
dimensional spacetime is trivial. 

Exercise 5.20. Show that the group 0(1,3) is non-compact and has four 
connected components according to the sign of the determinant and the sign of the 
(0, 0) entry. The component that contains the unit matrix is denoted by Oj_(L 3). 

The group GL(«, C) is the set of non-singular linear transformations in C" , 
which are represented by n x n non-singular matrices with complex entries. The 

unitary group U(n), the special linear group SL(u, C) and the special unitary 
group SU(«) are defined by 

U («) = {M e GL(«, Q| MM f = M f M = 1} (5.106) 

SL(«, C) = {M e GL (n, Q| det M = 1} (5.107) 

SU(n) = U(n) n SL(n, C) (5.108) 

where ' is the Hermitian conjugate. 

So far we have just mentioned that the matrix groups are subgroups of a Lie 
group GL(h, IK) (or GL(n, C)). The following theorem guarantees that they are 
Lie subgroups, that is, these subgroups are Lie groups by themselves. We accept 
this important (and difficult to prove) theorem without proof. 

Theorem 5.2. Every closed subgroup H of a Lie group G is a Lie subgroup. 

For example, O(n), SL(n, IK) and SO(n) are Lie subgroups of GL(n, IK). To 
see why SL(«, IK) is a closed subgroup, consider a map / : GL(n, IK) — > IR 
defined by A i-> det A. Obviously / is a continuous map and / -1 ( 1) = 
SL(n, IK). Apoint{l} is a closed subset of IK, hence / -1 (1) is closed in GL(n, IK). 
Then theorem 5.2 states that SL(«. IK) is a Lie subgroup. The reader should verify 
that O(n) and SO(n) are also Lie subgroups of GL(«, IK). 



Let G be a Lie group and H a Lie subgroup of G. Define an equivalence 
relation ~ by g ~ g' if there exists an element h e H such that g' = gh. An 
equivalence class [g] is a set {gh\h e H}. The coset space G /II is a manifold (not 
necessarily a Lie group) with dim G/H = dim G — dim H . G/H is a Lie group if 
H is a normal subgroup of G, that is, if ghg~ l e H for any g e G and h e H . In 
fact, take equivalence classes [g], [g'] e G/H and construct the product [g][g']. 
If the group structure is well defined in G/H , the product must be independent 
of the choice of the representatives. Let gh and g'h! be the representatives of [g] 
and [g'] respectively. Then ghg'h! = gg'h"b! e [gg'] where the equality follows 
since there exists h" e H such that hg' = g'h" . It is left as an exercise to the 
reader to show that [g] — 1 is also a well defined operation and [g] -1 = [g -1 ]. 


5.6.2 Lie algebras 

Definition 5.9. Let a and g be elements of a Lie group G. The right-translation 
R„ : G -» G and the left-translation L a : G — >• G of g by a are defined by 


Rag =ga (5.109a) 

L a g =ag. (5.109b) 

By definition, R a and L a are diffeomorphisms from G to G. Hence, the 
maps L a : G — > G and R a : G -> G induce L n , t . : T g G -> T ag G and 
Ra* ■ T g G — > Tg a G ; see section 5.2. Since these translations give equivalent 
theories, we are concerned mainly with the left-translation in the following. The 
analysis based on the right-translation can be carried out in a similar manner. 

Given a Lie group G, there exists a special class of vector fields characterized 
by an invariance under group action. [On the usual manifold there is no canonical 
way of discriminating some vector fields from the others.] 


Definition 5.10. Let X be a vector field on a Lie group G. X is said to be a left- 

invariant vector field if L a *X\ g = X\ ag . 

Exercise 5.21. Verify that a left-invariant vector field X satisfies 


L a *X\g = X^(g) 


3 x v (ag) d 
dx^(g) dx v 


= X v (ag) 


ag 


dx v 


(5.110) 


ag 


where x^(g) and x^iag) are coordinates of g and ag, respectively. 


A vector V e T e G defines a unique left-invariant vector field Xy throughout 
G by 

X V \ g = L g *V geG. (5.111) 

In fact, we verify from (5.34) that Xv\ ag — L ag *V = (L fl L ? )*V = L a *Lg*V = 
L a *X\/\g. Conversely, a left-invariant vector field X defines a unique vector 
V = X\ e e T e G. Let us denote the set of left-invariant vector fields on G by 



g. The map T e G -> g defined by V m* Xy is an isomorphism and it follows 
that the set of left-invariant vector fields is a vector space isomorphic to T e G. In 
particular, dim g = dim G. 

Since g is a set of vector fields, it is a subset of X(G) and the Lie bracket 
defined in section 5.3 is also defined on g. We show that g is closed under the 
Lie bracket. Take two points g and ag = L a g in G. If we apply L a * to the Lie 
bracket [X, Y] of X, Y e g, we have 


L a *[X, Y]\ g = [L a *X \ g , L a *Y\g] = [X, Y\\ ag (5.112) 

where the left-invariances of X and Y and (5.52) have been used. Thus, [X, Y] e 
g, that is g is closed under the Lie bracket. 

It is instructive to work out the left-invariant vector field of GL(«, E). The 
coordinates of GL(«, E) are given by n 2 entries x IJ of the matrix. The unit 
element is e — I n = Let g — {x lJ (g)} and a — {x lJ (a)} be elements 

of GL(n, E). The left-translation is 

L ag = ag = Y2 x ' k (a)x kj (g)- 


Take a vector V = V 1 ^ d/dx 1 ^ \ e e T e G where the V'j are the entries of V . The 

left-invariant vector field generated by V is 


X v \ g = L g ,V= Vij -£lj 


ijklm 


x kl (g)x lm (e) 


dx km 


9 


= T, vlj *"<*$*? to* 


= J2x ki (g)V ij =J2(S y ) 




; 9 


dx k i 


(5.113) 


where gV is the usual matrix multiplication of g and V . The vector Xy\ g is often 
abbreviated as gV since it gives the components of the vector. 

The Lie bracket of Xy and Xw generated by V — V'- 7 d/dx l i \ e and W — 
W^d/dx^U is 


[Xy, X w ]\ g = J2x ki (.gW ij 


dx k J 


x ca (g)W‘ 


ab 


dx cb 


( V W) 


= J2 xij (s)[V jk W kl - W jk v kl ] j-jj 


- w])lJ 


dx'- 1 


(5.114) 


Clearly, (5.113) and (5.114) remain true for any matrix group and we establish 
that 


L g * V — gV 

[Xv,X w ]| g =L g *[V,W]=g[V, W]. 


(5.115) 

(5.116) 



Now a Lie algebra is defined as the set of left-invariant vector fields g with 
the Lie bracket. 

Definition 5.11. The set of left-invariant vector fields g with the Lie bracket 
[ , ] : g x g — > g is called the Lie algebra of a Lie group G. 

We denote the Lie algebra of a Lie group by the corresponding lower-case 
German gothic letter. For example so(n) is the Lie algebra of SO (n). 

Example 5.15. 

(a) Take G — ffi as in exercise 5.19(b). If we define the left translation L a by 
xv- -+ x + a, the left-invariant vector field is given by X = d/dx. In fact, 

3 (a + x) 3 3 

dx 3 (a + x) 3 (x + a) 

Clearly this is the only left-invariant vector field on ffi. We also find that 
X — 3/3 6 is the unique left-invariant vector field on G — SO(2) = {e ie |0 < 
0 < 2jtj. Thus, the Lie groups ffi and SO(2) share the common Lie algebra. 

(b) Let gl(n, ffi) be the Lie algebra of GL (n, ffi) and c : (— e, e) — > GL (n. ffi) 

be a curve with c(0) = /„. The curve is approximated by c(s) — I n + s A + 
0(s 2 ) near ,v = 0. where A is an n x n matrix of real entries. Note that 
for small enough s, detcG) cannot vanish and c(s) is, indeed, in GL(n, ffi). 
The tangent vector to c(s) at /„ is c^s)] 0 = A. This shows that gl(n, ffi) 

is the set of n x n matrices. Clearly dimg[(«, ffi) = n 2 = dimGL(«, ffi). 
Subgroups of GL(«, ffi) are more interesting. 

(c) Let us find the Lie algebra si(«, ffi) of SL(n, ffi). Following this 
prescription, we approximate a curve through /„ bycG) = I n +sA+ 0(s 2 ). 
The tangent vector to cG) at l n is c'(s) | Q = A. Now, for the curve cG) to 
beinSL(«,ffi), cG) has to satisfy detcG) = 1 + 5-trA = 1, namely tr^4 = 0. 
Thus, 5 l(n, ffi) is the set of nxn traceless matrices and dim sUn, ffi) — n 2 — 1. 

(d) Let cG) = In + sA + 0(s 2 ) be a curve in SO («) through /„. Since 
cG) is a curve in SO («), it satisfies cG) t cG) — In- Differentiating this 
identity, we obtain c , G) t cG) + c(s) t c , G) = 0. At s = 0, this becomes 
A ( + A — 0. Hence, so («) is the set of skew-symmetric matrices. Since 
we are interested only in the vicinity of the unit element, the Lie algebra 
of O (n) is the same as that of SO (n): o(n) = so (n). It is easy to see that 
dimo(n) = dimso(n) = n{n — l)/2. 

(e) A similar analysis can be carried out for matrix groups of GL(n, C). 
g[(n, C) is the set of n x n matrices with complex entries and dim gl(«, Q = 
In 2 (the dimension here is a real dimension). s((n, C) is the set of traceless 
matrices with real dimension 2(w 2 — 1). To find u(«), we consider a 
curve cG) = I n + sA + 0(s 2 ) in U(n). Since c(syc(s) — we 
have c / G) t c(s) + cGl^c'G) = 0. At s = 0, we have A' + A = 0. 





Hence, u(n) is the set of skew-Hermitian matrices with dimu(n) = n 2 . 
su(n) = u (n) fl s!(«) is the set of traceless skew-Hermitian matrices with 
dimsu(n) — n 2 — 1. 


Exercise 5.22. Let 


cO) = 

be a curve in SO(3). Find the tangent vector to this curve at It,. 


cos s —sms 
sins cos s 
0 0 


5.6.3 The one-parameter subgroup 

A vector field X e X(M) generates a flow in M (section 5.3). Here we are 
interested in the flow generated by a left-invariant vector field. 

Definition 5.12. A curve 0 ■ ® — »■ G is called a one-parameter subgroup of G 
if it satisfies the condition 


0(f)0(s) = 0(f + s). (5.117) 

It is easy to see that 0(0) = e and 0 _1 (r) — 0(— r). Note that the curve 0 
thus defined is a homomorphism from M to G. Although G may be non- Abelian, 
a one-parameter subgroup is an Abelian subgroup: 0(t)0(s) = 0 (r + s) = 
0(s + 7) = 0(s)0(t). 

Given a one-parameter subgroup 0 : M -> G, there exists a vector field X, 
such that 

d0 /l ( t ) 

-^=^(0(0). (5.118) 

dr 

We now show that the vector field X is left-invariant. First note that the vector 
field d/df is left-invariant on TR, see example 5.15(a). Thus, we have 


d 

(L t )* — 
dr 


Next, we apply the induced map 0* : 7)1 
d/dt\ t , 


dr 


(5.119) 

70(r)G on the vectors d/dr |o and 


d 

0* T7 

dr 

d 

0 *d 1 


d0^(r) 


dr 

d0^(r) 


dr 


9 

o d § 11 
9 

t 3 


= X\ 
= X\ 


where we put 0 (?) = g . From (5.119) and (5 . 1 20b), we have 


d 

(0L r )* — 
dr 


— 0 * 7 .;*—— 

dr 


= X 


g- 


(5.120a) 

(5.120b) 

(5.121a) 



It follows from the commutativity <pL t — L g <p that (p. t L, t — L g *0*. Then 
(5.121a) becomes 


4>*Lr 


d t 




dr 


= Lg*X \, 


From (5.121), we conclude that 


(5.121b) 


L g *X\ e = X\ g . (5.122) 

Thus, given a flow (pit), there exists an associated left-invariant vector field 
X € g. 

Conversely, a left-invariant vector field X defines a one-parameter group of 
transformations cr(r, g) such that da(r, g)/dt — X and cr(0, g) = g. If we define 
4> : M — > G by (pit) = a it, e), the curve (pit) becomes a one-parameter subgroup 
of G. To prove this, we have to show (pis + t) = <pis)<pit). By definition, a 
satisfies 

d 

— ait, ois, e)) — Xiait, er(s, e))). (5.123) 

dr 

[We have omitted the coordinate indices for notational simplicity. If readers feel 
uneasy, they may supplement the indices as in (5.118).] If the parameter is fixed, 
ait, (pis)) = ( pis)(p(t ) is a curve M — > G at (pis)(pi 0) = (pis). Clearly a and a 
satisfy the same initial condition, 

a (0, ct ( s , <?)) = 5(0, (p(s)) = (pis). (5.124) 

a also satisfies the same differential equation as a : 

d d d 

—ait, (pit)) = —<p(s)<pit) = (L 0 ( s) )*—0(r) 

dr dr dr 

= {L^( s ))*X((p it)) 

= X i(p is)(p it)) (left-invariance) 

= X(5(t,(P(s))). (5.125) 

From the uniqueness theorem of ODEs, we conclude that 

(pis + t) — <pis)(p(t). (5.126) 

We have found that there is a one-to-one correspondence between a one- 
parameter subgroup of G and a left-invariant vector field. This correspondence 
becomes manifest if we define the exponential map as follows. 

Definition 5.13. Let G be a Lie group and V e T e G. The exponential map 
exp : T e G —*■ G is defined by 


exp V = (pvi 1) 


(5.127) 



where (j>\/ is a one-parameter subgroup of G generated by the left-invariant vector 
field Xy\ g = L g *V. 

Proposition 5.2. Let V e T e G and let f e EL Then 

exp(rV) = 4>v(t) (5.128) 

where <py(t ) is a one-parameter subgroup generated by Xy | g = L g *V . 


Proof. Let a ^ 0 be a constant. Then (py{at ) satisfies 


—<Pv(at) 

at 


1=0 


a—(py(t) 

at 


= aV 
t = o 


which shows that (py{at) is a one-parameter subgroup generated by L g *aV . The 
left-invariant vector field L g *aV also generates 4> a y{t) and, from the uniqueness 
of the solution, we find that <py(cit) — 4> a v(t). From definition 5.13, we have 


exp(aF) = <p aV { 1) = (py(a ). 
The proof is completed if a is replaced by t . 


□ 


For a matrix group, the exponential map is given by the exponential of a 
matrix. Take G — GL (n, R) and A e g[(n, R). Let us define a one-parameter 
subgroup 0,4 : R — >• GL(«, R) by 

t 2 t ” 

tpA (0 = exp(/A) = I n + t A + — A~ + •••-! — -A" + • • • . (5.129) 

2! n\ 

In fact, 4>A(t) e GL(n, R) since [</>a(0] -1 = <pA(—t) exists. It is also easy to see 
(pA (t)<pA G) = <f>(t + s). Now the exponential map is given by 

<t> A { 1) = exp (A) = /„ + A + i-A 2 + • • • + —A n + • • • . (5.130) 

2! n\ 

The curve g exp(7 A) is a flow through g e G. We find that 


— gexp(fA) 
at 


— L g *A — Xa | 


t = o 


where Xa is a left-invariant vector field generated by A. From (5.115), we find, 
for a matrix group G, that 


L g *A = X A \ g = gA. (5.131) 

The curve g exp(rA) defines a map a, : G -> G by a,(g) = g expit A) which is 
also expressed as a right-translation. 


— ^exp(rA)- 


(5.132) 



5.6.4 Frames and structure equation 


Let the set of n vectors {V\, V 2 , ... , V n \ he a basis of T e G where n — dim G. 
[We assume throughout this book that n is finite.] The basis defines the set of n 
linearly independent left-invariant vector fields {X\, X 2 , . . . , X„] at each point g 
in G by X tl | ^ = L g *Vp. Note that the set {X /t } is a frame of a basis defined 
throughout G. Since [Xp , X v ]\ g is again an element of 9 at g. it can be expanded 
in terms of {X /t } as 

[X ll ,X v ] = c llv k X 3i (5.133) 

where c IJ V k are called the structure constants of the Lie group G. If G is a matrix 
group, the LHS of (5.133) at g = e is precisely the commutator of matrices V jL 
and V v ; see (5.116). We show that the c |ttv A are, indeed, constants independent of 
g. Let C/n/ ie) be the structure constants at the unit element. If L gJf is applied to 
the Lie bracket, we have 


[X li ,X v ]\g = c liV x (e)X k \ g 

which shows the ^-independence of the structure constants. In a sense, the 
structure constants determine a Lie group completely (Lie’s theorem). 

Exercise 5.23. Show that the structure constants satisfy 

(a) skew-symmetry 

Cfiv — Cvfi (5.134) 

(b) Jacobi identity 

Cpv C T p + Cpp C T v T" C V p c T p — 0. (5.135) 

Let us introduce a dual basis to {X /t } and denote it by {0 M }; (0 lL , X v ) = <$y . 
{6 ll \ is a basis for the left-invariant one-forms. We will show that the dual basis 
satisfies Maurer-Cartan’s structure equation, 

dd kL = -jc^O" a 0 x . (5.136) 

This can be seen by making use of (5.70): 

d e»(x v , x k ) = x v [e^(x k )] - x x [e>*(x v )] - e^([x v . x k ]) 

= X v [S' k l - X k [S'J] - 0' l (c vk «X K ) = -cvx 11 

which proves (5.136). 

We define a Lie-algebra- valued one-form 9 : T^G -> T e G by 

9 : X h* (L g - i)*X = (L g )~ l X X e T g G. (5.137) 

9 is called the canonical one-form or Maurer-Cartan form on G. 



Theorem 5.3. (a) The canonical one-form 9 is expanded as 


e=V ll ®e li (5.138) 

where { Vj, } is the basis of T e G and {Q 11 } the dual basis of T*G. 

(b) The canonical one-form 9 satisfies 

d6+ i[0 A 9] = 0 (5.139) 

where d 9 = V ^ <g> d 9^ and 

[OA0] = [V ll ,V v i®0 fl AO v . (5.140) 


Proof. 

(a) Take any vector Y = Y 11 X /jL e T g G, where { X M ) is the set of frame 
vectors generated by { } ; X^lg — L g * V jL . From (5.137), we find 

9(Y) = y*0( X„) = y' t (L,,)- I [Lj,V #t ] = y*%. 


However, 


(V„ ® 0»)(Y) = Y v V^\X v ) = Y v V^ = y*%. 

Since Y is arbitrary, we have 9 = <g> 9^. 

(b) We use the Maurer-Cartan structure equation (5.136): 

d 9 + \[6 A 9] = 0 c vX ll 9 v A 9 X + ±c yA A % ® 9 V A 9 X = 0 

where the Cvx 11 are the structure constants of G. □ 

5.7 The action of Lie groups on manifolds 

In physics, a Lie group often appears as the set of transformations acting on a 
manifold. For example, SO(3) is the group of rotations in IR 3 , while the Poincare 
group is the set of transformations acting on the Minkowski spacetime. To study 
more general cases, we abstract the action of a Lie group G on a manifold M. 
We have already encountered this interaction between a group and geometry. In 
section 5.3 we defined a flow in a manifold M as a map a : M x M -> M, in 
which ffi acts as an additive group. We abstract this idea as follows. 

5.7.1 Definitions 

Definition 5.14. Let G be a Lie group and M be a manifold. The action of G on 
M is a differentiable map o : G x M -» M which satisfies the conditions 

(i) o(e,p)=p forany peM (5.141a) 

(ii) cr(gi, o(g 2 , p)) = o{g\g2, p). (5.141b) 



[Remark: We often use the notation gp instead of a(g, p). The second condition 
in this notation is giigip) = ig\gi)p-] 

Example 5.16. (a) A flow is an action of 1 on a manifold M. If a flow is 
periodic with a period T, it may be regarded as an action of U(l) or SO(2) 
on M. Given a periodic flow a(t, x ) with period T, we construct a new action 
&(exp(2jrit/T), x ) = er(f, x) whose group G is U(l). 

(b) Let M e GL («, IK) and let x e IK". The action of GL(«, IK) on IK" is 
defined by the usual matrix action on a vector: 


a(M , x) — M ■ x. 


(5.142) 


The action of the subgroups of GL(n, IK) is defined similarly. They may also act 
on a smaller space. For example, O (n) acts on S" _ 1 (r), an (n — l)-sphere of 
radius r, 


a : O (n) x S n ~\r) -> S' ,_ 1 (r). (5.143) 


(c) It is known that SL(2, C) acts on a four-dimensional Minkowski space 
M\ in a special manner. For x — (x , x 1 , x 2 , x 3 ) e M4, define a Hermitian 
matrix, 


X(x) = x^a^ 


A + .V 3 x l — ix 2 \ 
x 1 + ix 2 x° — x 3 ) 


(5.144) 


where a) t = (h , o \ , cr 2 , < 73 ), cr,- ( i — 1,2,3) being the Pauli matrices. Conversely, 
given a Hermitian matrix X, a unique vector (x /; ) e M 4 is defined as 


x^ = itr (cr M X) 


(5.130) 


where tr is over the 2x2 matrix indices. Thus, there is an isomorphism between 
M 4 and the set of 2 x 2 Hermitian matrices. It is interesting to note that 
detX(x) = (x 0 ) 2 — (x 1 ) 2 — (x 2 ) 2 — (x 3 ) 2 = — X'-rjX = —(Minkowski norm) 2 . 
Accordingly 

det X (x) > 0 if x is a timelike vector 

= 0 if x is on the light cone 

<0 if x is a spacelike vector. 


Take A e SL(2, Q and define an action of SL(2, C) on M 4 by 

cr(A,x) = AX(x)A\ (5.145) 


The reader should verify that this action, in fact, satisfies the axioms of definition 
5.14. The action of SL(2, Q on M 4 represents the Lorentz transformation 
0(1, 3). First we note that the action preserves the Minkowski norm, 

deta(A, x) = det[AX(x)A^] = detX(x) 



since det A = det A ' = 1. Moreover, there is a homomorphism <p : SL(2, C) — » 
0(1, 3) since 

A(BXB^)A^ = (AB)X(AB)\ 

However, this homomorphism cannot be one to one, since A e SL(2, Q and — A 
give the same element of 0(1, 3); see (5.145). We verify (exercise 5.24) that the 
following matrix is an explicit form of a rotation about the unit vector h by an 
angle 6, 


A = exp 


e „ 

-!-(/f • a) 


o „ e 

— cos — Ii — 1 (n ■ a) sin — . 


(5.146a) 


The appearance of 0 /2 ensures that the homomorphism between SL(2, C) and the 
0(3) subgroup of 0(1 , 3) is indeed two to one. In fact, rotations about h by 6 and 
by 27r + 0 should be the same 0(3) rotation, but A(2n+0) — — A(6) in SL(2, C). 
This leads to the existence of spinors. [See Misner et al (1973) and Wald (1984).] 
A boost along the direction h with the velocity v = tanh a is given by 

— (n ■ cr ) J = cosh —h + (« • a) sinh — . (5.146b) 

We show that ip maps SL(2, Q onto the proper orthochronous Lorentz group 
0+(l, 3) = [A £ 0(1, 3)| det A = +1, Aoo > 0}. Take any 

A=(« ^ ) € SL(2, Q 


and suppose = (1, 0, 0, 0) is mapped to x . If we write q>{A ) = A, we have 


, n 1 + 1 
x /0 = - tr (AXA f ) = - tr 


1 


= -(\ a \ 2 + \b\ 2 + \c\ 2 + \d\ 2 )>0 


hence Aoo > 0. To show det A = +1, we note that any element of SL(2, Q may 
be written as 


_ / e “ 0 \ / cos p sin yS e ly \ 

A “ \ 0 e _i “ JV — sin ft eT^ cos p ) B 

_ ( e”/ 2 0 \ 2 ( cos(/f/2) sin( / S/2)e i J / \ 2 

- V 0 e”/ 2 7 v — sin(yS/2)e _i>/ cos(£/2) ) 

= M 2 N 2 Bq 

where B = B 2 is a positive-definite matrix. This shows that (p(A) is positive 
definite: 

det^(A) = (det<p(M)) 2 (det^(A)) 2 (det^(Bo)) 2 > 0. 



Now we have established that <p( SL(2, C)) C o]_(l, 3). Equations (5.146a) and 
(5.146b) show that for any element of 0+(l, 3), there is a corresponding matrix 
A e SL(2, C), hence <p is onto. Thus, we have established that 

<p( SL(2, O) = 0]_(1, 3). (5.147) 

It can be shown that SL(2, C) is simply connected and is the universal covering 
group Spin( 1, 3) of oj_(l, 3), see section 4.6. 

Exercise 5.24. Verify by explicit calculations that 

(a) 



represents a rotation about the ’-axis by 0; 

(b) 

_ / cosh(a/2) + sinh(a/2) 0 \ 

y 0 cosh(a/2) — sinh(a/2) J 

represents a boost along the z-axis with the velocity v — tanh a. 

Definition 5.15. Let G be a Lie group that acts on a manifold M by a : G x M — > 
M. The action o is said to be 

(a) transitive if, for any p\ , pi e M, there exists an element g e G such 
that a (g, pi) = p 2 \ 

(b) free if every non-trivial element g e of G has no fixed points in M, 
that is, if there exists an element p e M such that cr(g, p) — p, then g must be 
the unit element e; and 

(c) effective if the unit element e e G is the unique element that defines the 
trivial action on M, i.e. if a(g, p) = p for all p e M, then g must be the unit 
element e. 

Exercise 5.25. Show that the right translation R : ( a,g ) i-> R a g and left 
translation L : («, g) L a g of a Lie group are free and transitive. 

5.7.2 Orbits and isotropy groups 

Given a point p e M, the action of G on p takes p to various points in M. The 
orbit of p under the action o is the subset of M defined by 

Gp = {a(g,p)\geG}. (5.148) 


If the action of G on M is transitive, the orbit of any p e M is M itself. Clearly 
the action of G on any orbit Gp is transitive. 



Example 5.17. (a) A flow a generated by a vector field X — —yd/dx + xd/dy is 
periodic with period 2tx, see example 5.9. The action u : 1 x t 2 -> l 2 defined 
by ( t , (x, y)) -* <x(t, (x, y)) is not effective since a (Inn, (x, y)) = ( x , y ) for all 
(x, y) e E 2 . For the same reason, this flow is not free either. The orbit through 
(x, y) ^ (0, 0) is a circle S 1 centred at the origin. 

(b) The action of 0(«) on E" is not transitive since if |x| ^ \x'\, no element 
of O(n) takes x to x' . However, the action of O(n) on S"~ l is obviously transitive. 
The orbit through x is the sphere .S'" - 1 of radius | jc | . Accordingly, given an action 
a : O in) x E" — > E", the orbits divide E" into mutually disjoint spheres of 
different radii. Introduce a relation by x ~ y if y = er(g, x ) for some g e G. It 
is easily verified that ~ is an equivalence relation. The equivalence class [x] is an 
orbit through x. The coset space E" /O in) is [0, oo) since each equivalence class 
is parametrized by the radius. 

Definition 5.16. Let G be a Lie group that acts on a manifold M. The isotropy 
group of p e M is a subgroup of G defined by 

H(p) = {geG\o(g,p) = p}. (5.149) 

Hip) is also called the little group or stabilizer of p. 

It is easy to see that H(p) is indeed a subgroup. Let gi, g 2 e Hip), then 
gig 2 e H(p) since cr(gig 2 , p) = <x(gt, <x(g 2 , p)) = a{g\,p) = p. Clearly 
e e H(p) since a(e, p) — p by definition. If g e Hip), then g~ l e Hip) since 
P = o(e, p) — o-(g _1 g, p) = o(g~ l ,cr(g, p)) = cr(g -1 , p). 

Exercise 5.26. Suppose a Lie group G acts on a manifold M freely. Show that 
H{p) — {e} for any p e M. 

Theorem 5.4. Let G be a Lie group which acts on a manifold M. Then the 
isotropy group H(p) for any p e M is a Lie subgroup. 

Proof. For fixed p e M, we define a map tp p : G -> M by tp p ig) = gp. Then 
H(p) is the inverse image tp ~ 1 ip) of a point p, and hence a closed set. The group 
properties have been shown already. It follows from theorem 5.2 that Hip) is a 
Lie subgroup. □ 

For example, let M — E 3 and G — SO(3) and take a point p — (0, 0, 1) e 
E 3 . The isotropy group Hip) is the set of rotations about the ’-axis, which is 
isomorphic to SO(2). 

Let G be a Lie group and H any subgroup of G. The coset space G/H admits 
a differentiable structure and G/H becomes a manifold, called a homogeneous 
space. Note that dim G/H = dimG — dim H. Let G be a Lie group which 
acts on a manifold M transitively and let Hip) be an isotropy group of p e M. 
Hip) is a Lie subgroup and the coset space G/Hip) is a homogeneous space. 



In fact, if G, H(p) and M satisfy certain technical requirements (for example, 
G/H(p) compact) is, it can be shown that G/H(p) is homeomorphic to M, see 
example 5.18. 

Example 5.18. (a) Let G — SO(3) be a group acting on M 3 and H — SO(2) be 
the isotropy group of x e M 3 . The group SO(3) acts on S 2 transitively and we 
have SO(3)/SO(2) = S 2 . What is the geometrical picture of this? Let g' — gh 
where g, g' e G and h e H . Since H is the set of rotations in a plane, g and 
g' must be rotations about the common axis. Then the equivalence class [g] is 
specified by the polar angles ( 0 , <p). Thus, we again find that G/H — S 2 . Since 
SO(2) is not a normal subgroup of SO(3), S 2 does not admit a group structure. 

It is easy to generalize this result to higher-dimensional rotation groups and 
we have the useful result 


SO(n + l)/SO(n) = S n . (5.150) 

O (n + 1) also acts on S n transitively and we have 

0(n + l)/0(n) = S n . (5.151) 

Similar relations hold for U(n) and SU(n): 

U (n + 1)/U(n) = SU(n + l)/SU(n) = S 2 " +1 . (5.152) 

(b) The group O (n + 1) acts on IIP" transitively from the left. Note, first, 
that 0(n + 1) acts on IR" +1 in the usual manner and preserves the equivalence 
relation employed to define HP" (see example 5.12). In fact, take x, x' e M' !+1 
and g e O (n + 1). If x ~ x' (that is if x' = ax for some a e M — {0}), then it 
follows that gx ~ gx' ( gx ' = agx). Accordingly, this action of 0(« + 1) on M' !+1 
induces the natural action of 0(n + 1) on KP" . Clearly this action is transitive 
on EP" . (Look at two representatives with the same norm.) If we take a point p 
in IRP" , which corresponds to a point (1, 0, . . . , 0) e , the isotropy group 
H(p) is 


H(p) = 


( ±1 0 
0 
0 

V o 


0(n) 


0 \ 




= 0(1) x O (n) 


(5.153) 


where 0(1) is the set { — 1, +1} = 7Lt_. Now we find that 

O (n + l)/fO(l) x 0(n)] = S"/Z 2 = IRP" . 


(5.154) 


(c) This result is easily generalized to the Grassmann manifolds: Gk.n TO = 
O(n) /[O(k) x 0(n — k)]. We first show that 0(/i) acts on Gt jn (ffi) transitively. 



Let A be an element of Gk, n TO, then A is a (-dimensional plane in W . Define an 
n x n matrix Pa which projects a vector v e K" to the plane A . Let us introduce an 
orthonormal basis {e\,...,e n } in W and another orthonormal basis [fi, ■ ■ ■ , fk] 
in the plane A, where the orthonormality is defined with respect to the Euclidean 
metric in M” . In terms of {e,}, f a is expanded as f a = fai^i and the projected 
vector is 


P A V = + h (vfk)fk 

— ^ ' (ty fli f\j Vi fki fkj ) e j — ^ ' Vi fai faj e j • 

i,j i,a,j 

Thus, Pa is represented by a matrix 

{PA)ij=Y,f«if°l- (5-155) 

Note that P^ — Pa, P\ = Pa and tr P,\ = k. [The last relation holds since it is 
always possible to choose a coordinate system such that 

P A = diagC L L L 0, ... .0). 

k n—k 

This guarantees that A is, indeed, a A;-dimensional plane.] Conversely any matrix 
P that satisfies these three conditions determines a unique ^-dimensional plane in 
K" , that is a unique element of G, t,„ (K). 

We now show that O(n) acts on Gk, n (US) transitively. Take A e Gk, n (®) an d 
g e O (n) and construct Pb = gPAg~ l ■ The matrix P/: determines an element 
B e G k.n ( TL) since Pj = P/;, P l B = Pb and tr Pb — k. Let us denote this 
action by B = <x(g , A). Clearly this action is transitive since given a standard 
('-dimensional basis of A, {/j, . . . , /*■} for example, any (-dimensional basis 
{/], . . . , fk} can be reached by an action of O(n) on this basis. 

Let us take a special plane Co which is spanned by the standard basis 
{/i, . . . , fk). Then an element of the isotropy group H (Co) is of the form 


( n—k 


M = 


81 0 
0 82 


k 

n — k 


(5.156) 


where g \ e O ((). Since M e O(n), an (n — () x (n — k) matrix g 2 must be an 
element of O (n — (). Thus, the isotropy group is isomorphic to 0(() x O (n — (). 
Finally we verified that 


G k ,„(M) = 0(n)/[0(() x O (n - ()]. (5.157) 

The dimension of Gk, n TO is obtained from the general formula as 

dimGyt^TO = dimO(n) — dim[0(() x O (« — ()] 

= ±_n(n - 1) - [i((( - 1) + \{n - k)(n - k - 1)] 

= ((/!-() (5.158) 



in agreement with the result of example 5.5. Equation (5.157) also shows that the 
Grassmann manifold is compact. 


5.7.3 Induced vector fields 

Let G be a Lie group which acts onM as(g,j) gx. A left-invariant vector 
field Xy generated by V e T e G naturally induces a vector field in M. Define a 
flow in M by 

a(t, x ) = exp(f L)jc, (5.159) 

cr (t,x) is a one-parameter group of transformations, and define a vector field 
called the induced vector field denoted by V®, 

exp (tV)x . (5.160) 

dt t = o 

Thus, we have obtained a map (t ■ T e G — > X(M) defined by V i— V' . 

Exercise 5.27. The Lie group SO(2) acts on M — it 2 in the usual way. Let 



be an element of so (2). 

(a) Show that 

. ... /cos t - sinf \ 

6Xp(fy)= ( sinr cost ) 

and find the induced flow through 

X= (y ) eR2 ' 

(b) Show that = —yd/dx +xd/d y. 

Example 5.19. Let us take G — SO (3) and M = M 3 . The basis vectors of T e G 
are generated by rotations about the x, y and z axes. We denote them by X x , X y 
and X-. respectively (see exercise 5.22), 

/ 0 0 0 \ / 0 0 1 \ / 0 -1 0 \ 

^=00-1, A } = I 0 0 0, A- = 1 0 0 . 

V o 1 \ -1 0 0 / \ooo/ 


Repeating a similar analysis to the previous one, we obtain the corresponding 
induced vectors, 




5.7.4 The adjoint representation 

A Lie group G acts on G itself in a special way. 

Definition 5.17. Take any a e G and define a homomorphism ad„ : G —> G by 
the conjugation, 

ad fl : g i-> aga~ l . (5.161) 

This homomorphism is called the adjoint representation of G. 

Exercise 5.28. Show that ad fl is a homomorphism. Define a map a : G x G — » G 
by a{a, g) = ad a g. Show that a (a, g) is an action of G on itself. 


Noting that ad fl e = e, we restrict the induced map ad fl * : T g G — > T. d( \ ag G to 

8 = e, 

Ad (l : T e G -> T e G (5.162) 

where Ad fl = ad fl *l 7 - G . If we identify T e G with the Lie algebra g, we have 
obtained a map Ad : G x g -> g called the adjoint map of G. Since 
ad fl *ad/,* = ad a b*, it follows that Ad fl Ad* = Ad a b. Similarly, Ad fl -i = Ad" 1 
follows from ad^i^ad,,*!^ = id t„g- 

If G is a matrix group, the adjoint representation becomes a simple matrix 
operation. Let g e G and Xy eg, and let o\/(t) — exp(f L ) be a one- 
parameter subgroup generated by V e T e G. Then adj, acting on a y(f) yields 
gexp(rL)g _1 = exp (tgVg -1 ). As for Ad ? we have Ad ? : V i-> gVg~ l since 


AdgV = 


d 

— [ad g exp(tV)] 




exp(rgLg x ) 


t = o 


t=0 


gVg 


-t 


(5.163) 


Problems 


5.1 The Stiefel manifold Vim, r) is the set of orthonormal vectors { e, } (1 < i < 
r ) in TR m ( r < m). We may express an element A of V (m, r) by an m x r matrix 
{e\, *. . , e r ). Show that SO (m) acts transitively on V{m, r). Let 


/ 

1 

0 

... 0 



0 

1 

... 0 



0 

0 

... 1 



0 

0 

... 0 


V 

0 

0 

... 0 

/ 


be an element of V ( m , r). Show that the isotropy group of ,4 q is SO(w — r). Verify 
that V(m, r) — SO(m)/SO(m — r) and dim V(m, r) — [r(r — l)]/2 + r(m — r). 
I Remark : The Stiefel manifold is, in a sense, a generalization of a sphere. Observe 
that V(m, 1) = 



5.2 Let M be the Minkowski four-spacetime. Define the action of a linear operator 


* : Q r (M 


(M) by 






r = 0 

*1 = 

= -dx° 

A 

dx 1 

9 

A d.Y" A d.Y 

3. 


r = 1 

*d.v 

' - -dx J 

A dr* A d.Y° 

* d.x° 

— — dx 1 A d.Y 2 A d.Y 3 

r — 2 

*d.v 

> 

CL 

* 


dx k 

A d.Y° 

* dx' A 

d.Y° = — dx-' A dx k \ 

r = 3 

*d.Y 

1 A d.Y 2 

■ 

d.Y 3 

= — d.Y° 

* dx' 

A dx ' A d.Y° = — dx k 

r = 4 

*dv 

0 A d.Y 1 

A 

d.Y 2 

A d.Y 3 = 1; 




where (i, j, k ) is an even permutation of (1,2, 3). The vector potential A and 
the electromagnetic tensor F are defined as in example 5.11. J = J^dx' 1 — 
pdx° + jk dx k is the current one-form. 

(a) Write down the equation d * F — *J and verify that it reduces to two of the 
Maxwell equations V • E = p and V x B — dE/dt = j . 

(b) Show that the identity 0 = d(d * F) = d * J reduces to the charge 
conservation equation 

9p 

9^ = £ + v -j = 0. 

ot 

(c) Show that the Lorentz condition = 0 is expressed as d * A = 0. 
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DE RHAM COHOMOLOGY GROUPS 


The homology groups of topological spaces have been defined in chapter 3. If 
a topological space M is a manifold, we may define the dual of the homology 
groups out of differential forms defined on M. The dual groups are called the 
de Rham cohomology groups. Besides physicists’ familiarity with differential 
forms, cohomology groups have several advantages over homology groups. 

We follow closely Nash and Sen (1983) and Flanders (1963). Bott and Tu 
(1982) contains more advanced topics. 

6.1 Stokes’ theorem 

One of the main tools in the study of de Rham cohomology groups is Stokes’ 
theorem with which most physicists are familiar from electromagnetism. Gauss’ 
theorem and Stokes’ theorem are treated in a unified manner here. 

6.1.1 Preliminary consideration 

Let us define an integration of an / -form over an /--simplex in a Euclidean space. 
To do this, we need first to define the standard //-simplex a r = (popi . . . p r ) in 
W where 


PO = (0, 0, . . . , 0) 

Pi = ( 1 , 0 ,..., 0 ) 


Pr = (0, 0, .... 1) 

see figure 6. 1 . If {x^} is a coordinate of W , d r is given by 
a r — (x 1 , . . . , x r ) e IR'" 

An /--form a> (the volume element) in W is written as 



( 6 . 1 ) 


ft/ = a(x) dx 1 A dx 2 A ... A dx' . 



Figure 6.1. The standard 2 -simplex 02 — (P0P1P2) and the standard 3 -simplex 
°3 = (PoPl P2 P3)' 


We define the integration of co over cr r by 


I co = I a(x) dx 1 dx 2 . . . dx r 
J (J r J (J r 


where the RHS is the usual r-fold integration. For example, if r — 2 and 
co = dx A dy, we have 


f co = [ dx dy = f dx f dy = 
J 02 J cry JO JO 


Next we define an r -chain, an r-cycle and an r-boundary in an m- 
dimensional manifold M. Let oy be an r -simplex in W and let / : a r -> M 
be a smooth map. [To avoid the subtlety associated with the differentiability of 
/ at the boundary of ay, / may be defined over an open subset U of W , which 
contains a,-.] Here we assume f is not required to have an inverse. For example, 
im / may be a point in M. We denote the image of ay in M by s r and call it a 
(singular) / -simplex in M. These simplexes are called singular since they do not 
provide a triangulation of M and, moreover, geometrical independence of points 
makes no sense in a manifold (see section 3.2). If { .sy, / } is the set of r-simplexes 
in M, we define an r -chain in M by a formal sum of {s r j\ with M-coefficients 



In the following, we are concerned with IK-coefficients only and we omit the 
explicit quotation of ffi. The r -chains in M form the chain group C, (M). Under 
/ : ay — > M, the boundary 3 a,- is also mapped to a subset of M. Clearly, 
ds r = /( 3oy) is a set of (r — l)-simplexes in M and is called the boundary of 



s r . ds r corresponds to the geometrical boundary of s r with an induced orientation 
defined in section 3.3. We have a map 

3 : C,(M) — > C,_i(M). (6.4) 

The result of section 3.3 tells us that 3 is nilpotent; 3 2 = 0. 

Cycles and boundaries are defined in exactly the same way as in section 3.3 
(note, however, that Z is replaced by M). If c r is an r -cycle, 3 cy = 0 while if c r 
is an r-boundarv. there exists an (r + l)-chain c r+ \ such that c r — 3c r+ i . The 
boundary group B r (M ) is the set of /--boundaries and the cycle group Z, (M) 
is the set of r -cycles. There are infinitely many singular simplexes which make 
up C r (M), B,-(M ) and Z r (M). It follows from 3 2 = 0 that Z, (M ) D B r (M); cf 
theorem 3.3. The singular homology group is defined by 


H r (M) = Z r (M)/B r (M). 


(6.5) 


With mild topological assumptions, the singular homology group is isomorphic to 
the corresponding simplicial homology group with IR-coefficients and we employ 
the same symbol to denote both of them. 

Now we are ready to define an integration of an / -form o> over an /'-chain in 
M. We first define an integration of co on an r- simplex s r of M by 



( 6 . 6 ) 


where / : a r -> M is a smooth map such that s r = f{p r ). Since f*a> is 
an r - form in W , the RHS is the usual /'-fold integral. For a general /'-chain 
c = a i s r,i e C r (M), we define 


/ co = J2 a i / 

Jc • Js r j 


CO. 


(6.7) 


6.1.2 Stokes’ theorem 


Theorem 6. 1. (Stokes’ theorem) Let &> e £Z r 1 (M) and c e C r (M). Then 



( 6 . 8 ) 


Proof. Since c is a linear combination of /--simplexes, it suffices to prove (6.8) for 
an r- simplex s r in M. Let / : ay -> M be a map such that f(8 y) = s r . Then 



L 


d (/*©) 


where (5.75) has been used. We also have 



f*co. 



Note that f*a> is an (/- — l)-form in IK'' . Thus, to prove Stokes’ theorem 


/ d co = I co 
J s r J ds r 


(6.9a) 


x[r 


(6.9b) 


it suffices to prove an alternative formula 

[_ df= f 

J Cff J dOr 

for an (r — l)-form x[r in W . The most general form of x[r is 

xfr — M (x) dx 1 A ... A dx^ 1 A dx /t+1 A ... A dx' . 

Since an integration is distributive, it suffices to prove (6.9b) for x[r = a(x)dx 1 A 
. . . A dx' -1 . We note that 

dl/r = -E dx r a dx 1 A ... A dx' -1 = (—l)' -1 -E dx 1 a ... A dx'— 1 A dx' . 
9x'- dx r 

By direct computation, we find, from (6.2), that 


[ dxjr = (— l) r 
J O r 


= (-D r 


1 f — dx 1 

Ja r dx' 


. . dx'- 1 dx' 


^>0-Eu=j x"<l 


dx 1 . . . dx 


f l ^/i=i x '‘ da 
Jo dx'' C 

r— 1 n . 

c 1 x' _1 , 1 — ^2 x j ~ a (E • • • » x r ~ l , 0^ 


n = \ 


For the boundary of o r , we have 


do,- = (p 1, P2 Pr) - (po, P2, Pr) 

H h (-l) r (B0, pi Pr- 1). 

Note that i jr — a(x)dx 1 A ... A dx' - 1 vanishes when one of x , . . . , x r ~ 1 is 
constant. Then it follows that 


L 


x[r — 0 


(p 0 ,p 2 ,-.pr) 

since x 1 = 0 on (po, p 2 , . . . , p r ). In fact, most of the faces of do,- do not 
contribute to the RHS of (6.9b) and we are left with 


/ f = f 

J 3 O r J (l 


^ + (-i y 


(. Pl,P2,--,Pr ) 


L 


xjf. 


(PO,PX Pr- 1) 



Since (po. pi, ■ ■ ■ , p,— 1 ) is the standard (r — l)-simplex (x 11 > 0, X>1 — 

1), on which x r — 0, the second term is 


(— 1 V I ijr = (— l) r / a(x l ,...,x r x , 0) dx 1 . . . dx r x . 

J(P0,Pl,-,Pr-l) Jar-i 

The hrst term is 



1 a j x , . . 

,.,x r -\ 1 - y x" Idx 1 ...dr'- 1 

hpi,P2,....p r ) 

J(P 1 Pr-l,P0) V 

1 7 


= (-I)''” 1 f a(x\...,x r -\ 1 - ^x /i ')djc 1 ...dj r_1 
J°r~ 1 ' M =l ' 


where the integral domain (pi, . . . , p r ) has been projected along x r to the 

(pi p r - 1 , poj-plane. preserving the orientation. Collecting these results, we 

have proved (6.9b). [The reader is advised to verify this proof for m — 3 using 
figure 6.1.] □ 

Exercise 6.1. Let M = M 3 and co — a d x+b d v+c dz. Show that Stokes’ theorem 
is written as 


J curl co ■ dS = (j) 


co ■ dS 


(Stokes’ theorem) 


( 6 . 10 ) 


where co — (a,b,c) and C is the boundary of a surface S. Similarly, for 
i jr = j V / /iu dx^ A d.r y , show that 


L 


div ifr dL 



(Gauss’ theorem) 


where i jr x = e' 4> ' v \jj and S is the boundary of a volume V . 


6.2 de Rham cohomology groups 
6.2.1 Definitions 

Definition 6.1. Let M be an m -dimensional differentiable manifold. The set of 
closed / -forms is called the rth cocycle group, denoted Z' (M). The set of exact 
/--forms is called the rth coboundary group, denoted B' (M). These are vector 
spaces with M-coefficients. It follows from d 2 = 0 that Z' (M) D B r (M). 

Exercise 6.2. Show that 

(a) if co e Z' (M) and i jr e Z S (M), then co A e Z r+S (M ); 

(b) if co e Z r (M) and e B S (M), then co A e B r+S (M ); and 


(c) if co e B r (M) and \ jr e B S (M ), then wAf e B r+S (M). 


Definition 6.2. The rth de Rham cohomology group is defined by 

H r (M; E) = Z'\M)/B r (M). (6.11) 

If r < — 1 or r > m + 1, H' iM: E) may be defined to be trivial. In the following, 
we omit the explicit quotation of E-coefficients. 

Let co e Z' (M). Then [co] e H' (M ) is the equivalence class {a/ e 
Z r (M)\co r — co + dfr, fr e £T' _1 (M)}. Two forms which differ by an exact 
form are called cohomologous. We will see later that H 1 [M ) is isomorphic to 
H r (M). The following examples will clarify the idea of de Rham cohomology 
groups. 

Example 6.1. When r = 0, B°(M) has no meaning since there is no (— l)-form. 
We define £2 -1 (M) to be empty, hence B°(M ) = 0. Then H°(M) — Z°(M ) = 
{/ e Q.°(M) — fF(M)|d/ = 0}. If M is connected, the condition df — 0 is 
satisfied if and only if / is constant over M. Hence, H°(M) is isomorphic to the 
vector space E, 

H°(M) = E. (6.12) 

If M has n connected components, df — 0 is satisfied if and only if / is constant 
on each connected component, hence it is specified by n real numbers, 

H°(M) = E © E © • • • © E . (6.13) 

' 

n 


Example 6.2. Let M — E. From example 6.1, we have H°(W) — E. Let us 
find H l (W) next. Let x be a coordinate of E. Since dimE = 1, any one-form 
co e £2* (E) is closed, d co = 0. Let co — fdx, where / e T(E). Define a function 
F(x) by 

F{x ) = I f(s ) ds e F(W) = S2°(E). 

Jo 

Since d F (.r ) / d.v = fix), co is an exact form, 

dT(x) 

co — f dx — dx = d F. 

dx 

Thus, any one-form is closed as well as exact. We have established 

H\ E)={0}. (6.14) 

Example 6.3. Let = {e 10 |O < 9 < 2n}. Since S l is connected, we have 
H°iS l ) = E. We compute H l (S l ) next. Let co = fid) d0 e Is it 



possible to write co = tl F for some F e If ( S 1 ) ? Let us repeat the analysis of the 
previous example. If co — (IF, then F e ,T ( ,S' 1 ) must be given by 


F(0)= f f (O') dO'. 

Jo 

For F to be defined uniquely on S 1 , F must satisfy the periodicity F(2jt) — 
F( 0) (=0). Namely F must satisfy 

p2 n 

F(2n) = / f(O') dd' = 0. 

Jo 

If we define a map X : f2 1 (5 I ) — »• M by 


X : co = / dO 



f(o')de’ 


then B [ (S l ) is identified with ker/.. Now we have (theorem 3.1) 


H l (S l ) = ^(SVkerA. = imk = EL 


(6.15) 


(6.16) 


This is also obtained from the following consideration. Let co and o> be closed 
forms that are not exact. Although co — aJ is not exact in general, we can show 
that there exists a number oel such that co’ — aco is exact. In fact, if we put 


a = 





we have 



(co' — aco) = 0. 


This shows that, given a closed form co which is not exact, any closed form o> is 
cohomologous to aco for some a e EL Thus, each cohomology class is specified 
by a real number a , hence // 1 (.S' 1 ) = EL 


Exercise 6.3. Let M — Mr — {0}. Define a one-form co by 


x l + y- 


■ dx 


x- + y- 


dy. 


(6.17) 


(a) Show that co is closed. 

(b) Define a ‘function’ F(x, y) — tan ~ l (y/x). Show that co = d F. Is co 
exact? 



6.2.2 Duality of H r (M ) and H r (M ); de Rham’s theorem 


As the name itself suggests, the cohomology group is a dual space of the 
homology group. The duality is provided by Stokes’ theorem. We first define 
the inner product of an /'-form and an /--chain in M. Let M be an ///-dimensional 
manifold and let C r (M ) be the chain group of M. Take c e C r (M ) and co e 
where 1 < r < in. Define an inner product ( , ) : C,(M) x Q r (M) — »• 

Mby 


C, C0 (c, CO) 



(6.18) 


Clearly, (c, co) is linear in both c and co and ( , co) may be regarded as a linear 

map acting on c and vice versa. 


(c\ + c 2 , co) = 


(c, 0)1 + a>2) = / (o>l + 0)2) — J COl + J CO 2 - 
Now Stokes’ theorem takes a compact form: 


( »=/'»+/' 

J C\+C2 J C\ Jc ' 

k 


CO 


(6.19a) 

(6.19b) 


(c, do>) = (3c, o>). 


( 6 . 20 ) 


In this sense, the exterior derivative operator d is the adjoint of the boundary 
operator 3 and vice versa. 

Exercise 6 . 4 . Let (i) c e B r (M), co e Z r (M) or (ii) c e Z r {M), co e B r (M). 
Show, in both cases, that (c, o>) = 0. 


The inner product ( , ) naturally induces an inner product k between 

the elements of H, (M ) and H r (M). We now show that H, (M ) is the dual 
of H r (M). Let [c] e H, (M) and [o>] e H r (M ) and define an inner product 
A : H r (M) x H r (M) -> M by 


A([c], [tw]) = (c, co) = 



( 6 . 21 ) 


This is well defined since (6.21) is independent of the choice of the 
representatives. In fact, if we take c + 3 c\ c’ e C r +\ ( M ), we have, from Stokes’ 
theorem, 

(c + 3 d , co) = (c, co) + (d , dft/) = (c, co) 
where d co — 0 has been used. Similarly, for co + di jr, i// e Q. r ~ { (M), 


(c, co + d i/r) — (c, co) + (3c, \jr) = (c, co) 

since 3c = 0. Note that A( , [a/]) is a linear map H r (M) — > 1 R, and A([c], ) is 

a linear map H' (M) -> IK. To prove the duality of H r (M) and H' {M), we have 



to show that A( , [&>]) has the maximal rank, that is, dim H r {M ) = dim H'(M). 
We accept the following theorem due to de Rham without the proof which is 
highly non-trivial. 

Theorem 6.2. (de Rham’s theorem) If M is a compact manifold, H r (M ) and 
H r (M) are finite dimensional. Moreover the map 

A : H r (M) x H r {M) ® 


is bilinear and non-degenerate. Thus, H r (M ) is the dual vector space of H r (M). 

A period of a closed r-form co over a cycle c is defined by (c, co) — f co. 
Exercise 6.4 shows that the period vanishes if co is exact or if c is a boundary. The 
following corollary is easily derived from de Rham’s theorem. 


Corollary 6.1. Let M be a compact manifold and let k be the rth Betti number 
(see section 3.4). Let ci, C 2 , . . . , Ck be properly chosen elements of Z r (M ) such 
that [q] ^[cj]. 

(a) A closed r-form x/r is exact if and only if 


L 


xfr — 0 (1 < i < k). 


( 6 . 22 ) 


(b) For any set of real numbers h \ , £> 2 , • • • , bk there exists a closed r-form co 
such that 



(1 < i < k). 


(6.23) 


Proof, (a) de Rham’s theorem states that the bilinear form A([c], [&>]) is non- 
degenerate. Hence, if A([c ; ], ) is regarded as a linear map acting on H r (M), 

the kernel consists of the trivial element, the cohomology class of exact forms. 
Accordingly, 1 jr is an exact form. 

(b) de Rham’s theorem ensures that corresponding to the homology basis 
{[ci]}, we may choose the dual basis {[ft>;]} of H r (M) such that 


A([c;], [coj]) 



If we define co = 


bjCOi, the closed r-form co satisfies 


(6.24) 


L 


a> — bi 


as claimed. 


□ 


For example, we observe the duality of the following groups. 



(a) H°(M) = Hq(M ) = ffi © • • • © E if M has n connected components. 

n 

(b) = HfS 1 ) = ffi. 

Since H' (M) is isomorphic to H r (M), we find that 

b r (M ) = dim FI' (M) = dim H r (M) = b r (M) (6.25) 


where b, (M) is the Betti number of M. The Euler characteristic is now written as 


m 

X(M) = JV imM). (6.26) 

r= 1 


This is quite an interesting formula; the LHS is purely topological while the RHS 
is given by an analytic condition (note that dm — 0 is a set of partial differential 
equations). We will frequently encounter this interplay between topology and 
analysis. 

In summary, we have the chain complex C(M) and the de Rham complex 


Cr-l(M) 4 - Cr(M ) C r+ 1(M ) 

Q r - ] (M) Q r (M) W+\M) 


(6.27) 


for which the rth homology group is defined by 


H r (M) = Z r (M)/B r (M) = ker3,./im3 ;+ i 
and the rth de Rham cohomology group is defined by 

H r {M) = Z r (M)/B'\M) = kerd,. + i/imd,. 


6.3 Poincare’s lemma 

An exact form is always closed but the converse is not necessarily true. However, 
the following theorem provides the situation in which the converse is also true. 

Theorem 6.3. (Poincare’s lemma) If a coordinate neighbourhood U of a 
manifold M is contractible to a point po e M , any closed r-form on U is also 
exact. 

Proof. We assume U is smoothly contractible to po. that is, there exists a smooth 
map F : U x / U such that 


F(x. 0) = x, 


F(x, 1) = po 


for.v e U. 



Let us consider an r-form rj e £2 r (U x I), 


i] = a ;i ..., r (x, t ) dx' 1 A ... A dx' r 

+ bj l ...j r _ l (x, t) dr A dx 71 A ... A dx 7r_1 (6.28) 

where x is the coordinate of U and t of I. Define a map P : Q' (U x /) — > 
S2 r - 1 (f/) by 

Pr) = d.s- (x, s)J dx 71 A ... A dx 7r_1 . (6.29) 

Next, define a map f t :U—>UxI by ft(x) = (x, f). The pullback of the first 
term of (6.28) by f* is an element of Q r (U), 

f*n = a h . Jr (x, t) dx' 1 A ... A dx' r e Q. r (U). (6.30) 

We now prove the following identity, 

d(Pr?) + P(dr,) = /i*r? - f 0 *r,. (6.31) 


Each term of the LHS is calculated to be 


-n: 

= f d.s- 

Jo 


( PU\ ...i r 
3x , '+ 1 

...i r 

3 r 


dx 71 A ... A dx 7 '- -1 


dx 7r A dx 71 A ... A dx 7r_1 


dx ' r+1 A dx ' 1 A ... A dx‘ r 


dr A dx ' 1 A ... A dx l ‘ 


dx Jr A dr A dx 71 A ... A dx 7r_1 


[jC* 

-U 


d.s / — d "' 1 ’ j dx ' 1 A . . . A dx' r 


dx 7r A dx 71 A . . . A dx 7r-1 . 


Collecting these results, we have 


d (Pi]) + P(d)j) = \ I dj 


dx ' 1 A ... A dx' r 


= [ai v ..i r (x, 1) - a ;i ..., r (x, 0)] dx ' 1 A ... A dx'- 

= h*-n - fo*f]- 



Poincare’s lemma readily follows from (6.31). Let a be a closed / -form on a 
contractible chart U. We will show that co is written as an exact form, 

co = d(-PF*co), (6.32) 

F being the smooth contraction map. In fact, if // in (6.31) is replaced by 
F*a> e £}' (U x /) we have 

d PF*co + P d F*co = f x * o F*co - fo* o F*co 

= (Fof x )*co-(Fof 0 )*co (6.33) 

where use has been made of the relation (fog)* — g* o f* . Clearly Fo/j : U -> 
U is a constant map x i->- po, hence ( F o f\)* — 0. However, F o fo — idy, 
hence (F o fo)* : £2 r (U) — > £2' (U) is the identity map. Thus, the RHS of 
(6.33) is simply — co. The second term of the LHS vanishes since a> is closed; 
dF*cu = F* da) = 0, where use has been made of (5.75). Finally, (6.33) becomes 
co = — d P F*co, which proves the theorem. □ 

Any closed form is exact at least locally. The de Rham cohomology group is 
regarded as an obstruction to the global exactness of closed forms. 

Example 6.4. Since K" is contractible, we have 

H’\ ffi") = 0 1 <r<n. (6.34) 

Note, however, that H°(W) = ffi. 

6.4 Structure of de Rham cohomology groups 

de Rham cohomology groups exhibit quite an interesting structure that is very 
difficult or even impossible to appreciate with homology groups. 

6.4.1 Poincare duality 

Let M be a compact ///-dimensional manifold and let co e H r (M) and i] e 
H m ~ r (M). Noting that o> A // is a volume element, we define an inner product 
(,): H'(M) x H m ~ r (M) -> ffi by 

(co,r])= I ©A/). (6.35) 

Jm 

The inner product is bilinear. Moreover, it is non-singular, that is, if co ^ 0 
or /; ^4 0, (co, /;} cannot vanish identically. Thus, (6.35) defines the duality of 
H'\M) and H m ~ r (M), 


H'(M) = H m ~'(M) 


(6.36) 



called the Poincare duality. Accordingly, the Betti numbers have a symmetry 


b r = b m - T . (6.37) 

It follows from (6.37) that the Euler characteristic of an odd-dimensional space 
vanishes, 

X(M) = £(-D r *r = ij £(-l ) r br + £(-l> B - r *m-r j 

= T 'D-Wr - = °- (6-38) 

6.4.2 Cohomology rings 

Let [&>] e H q (M) and [ r] ] e H'\M). Define a product of [a>] and [rj] by 

MaM^^ai,]. (6.39) 

It follows from exercise 6.2 that a> A is closed, hence [&> A rj] is an element of 
H q+r (M). Moreover, [co A i]\ is independent of the choice of the representatives 
of [oo] and [?;]. For example, if we take co' = co + d x/r instead of co , we have 

[co r ] A [rj] = [(cu + d \]f) A rj] — [co A ;; + d(i fr A r/)] — [a> A rj]. 

Thus, the product A : H q (M) x H r (M ) -> H q+r {M) is a well-defined map. 
The cohomology ring H*(M) is defined by the direct sum, 

m 

ff*(M) = 0/l r (M). (6.40) 

r— I 

The product is provided by the exterior product defined earlier, 

A : H*(M) x H*(M) -a //*(M). (6.41) 

The addition is the formal sum of two elements of One of the 

superiorities of cohomology groups over homology groups resides here. Products 
of chains are not well defined and homology groups cannot have a ring structure. 

6.4.3 The Kiinneth formula 

Let M be a product of two manifolds M — M i x M 2 . Let { oj-’ } (1 < i < 

b p (Mi )) be a basis of H P (M \ ) and {ij 1 ’} (1 < i < b p (M2 )) be that of H p (Mi). 
Clearly 0 /’ A r[- p (1 < p < r) is a closed r-form in M. We show that it is not 
exact. If it were exact, it would be written as 

w f A 1 {r p = d (a p ~ l A p r ~ p + Y p A S r ~ p ~ l ) 


(6.42) 



for some a ', p ~' e Q p~ 1 (M { ), p r ~P e Q. r ~P(M 2 ), y p e QP(Mi) and 8 r ~P~ l e 
Q' -P ~ 1 ( /V/ 2 ) - [If p — 0, we put a p ~ l = 0.] By executing the exterior derivative 
in (6.42), we have 

cof A tfr p = da"” 1 A p r ~P + (-1 )P- l a p ~ l A d 0 r ~ p 

+ dy p A 8 r ~ p -' +{-\)Py p A d8 r - p ~ l . (6.43) 

By comparing the LHS with the RHS, we find a p ~ l = 8'~ p ~ l = 0, hence 
w" A rfj p = 0 in contradiction to our assumption. Thus, u> p A r\ ■ p is a non- 
trivial element of H r (M). Conversely, any element of H' (M) can be decomposed 
into a sum of a product of the elements of H P (M\) and H r ~ p {M 2 ) for 0 < P < r. 
Now we have obtained the Kiinneth formula 

H r (M) — 0 [//"(Mi)® // 9 (M 2 )]. (6.44) 

p+q=r 

This is rewritten in terms of the Betti numbers as 

b r (M) = b p {M\)b q (M 2 ). (6.45) 

p+q=r 


The Kiinneth formula also gives a relation between the cohomology rings of the 
respective manifolds, 

m m 

H*(M) = Yj H r ( M ) = E © //"(Mi) ® H q (M 2 ) 

r= 1 r= 1 P+q=r 

= Y HP ( M 1 ) ® E ff9 ( M 2) = H*(Mi) ® H*(M 2 ). (6.46) 
p q 

Exercise 6.5. Let M — M 1 x M 2 . Show that 

X(M) — x(Mi) • x(M 2 ). (6.47) 

Example 6.5. Let T 2 — 5 1 x .S' 1 be the torus. Since //‘V.S 1 ) = M and // 1 (.S 1 ) = 
K, we have 


H°(T 2 ) = M ® M = M (6.48a) 

H l (T 2 ) = (K®K) ®(M®M) =M©K (6.48b) 

H 2 (T 2 ) = K®E = E. (6.48c) 

Observe the Poincare duality H°(T 2 ) — H 2 (T 2 ). [ Remark : M ® M is the tensor 
product and should not be confused with the direct product. Clearly the product 
of two real numbers is a real number.] Let us parametrize the coordinate of T 2 



as (0i, 66) where 6/ is the coordinate of S 1 . The groups H r (T 2 ) are generated by 
the following forms: 


r = 0 : 

WQ 

— Co CO € M 



r = 1 : 

W\ 

= Cl ddi + c'i d 02 

ci , c'j e K 

(6.49a) 

r = 2: 

W2 

= C2 d0\ A d()i 

C2 e IR. 



Although the one-form dOj looks like an exact form, there is no function 
is defined uniquely on S l . Since /(S 1 ) = 0, we have x(T 2 ) — 0. 

The de Rham cohomology groups of 

T n = S l x • • • x S 1 

^ v ^ 

n 

are obtained similarly. H r (T n ) is generated by /'-forms of the form 

dtT 1 A d0 h A ... A d e ir 

where i\ < F < ■ ■ ■ < i r are chosen from 1 Clearly 
b r = dim H r (T n ) = 

The Euler characteristic is directly obtained from (6.51) as 

x(T n ) = ^C-I) r (") =(i- 1)" = 0. 

6.4.4 Pullback of de Rham cohomology groups 

Let / : M -> N be a smooth map. Equation (5.75) shows that the pullback /* 
maps closed forms to closed forms and exact forms to exact forms. Accordingly, 
we may define a pullback of the cohomology groups /* : H r (N) — » H' (M) by 

/>] = [f*co] M e H r (N). (6.53) 

The pullback /* preserves the ring structure of H*(N). In fact, if [o>] e H P (N ) 
and [?;] e H q (N), we find 

f*([co] A [l,]) = f*[cO A IJ] = [f*(cO A /,)] 

= [f*co A f*tj] = [ f*w ] A [f*nl (6.54) 

6.4.5 Homotopy and H l (M) 

Let /, g : M — > A be smooth maps. We assume f and g are homotopic to each 
other, that is, there exists a smooth map F : M x / -> N such that F(p, 0) = 


Qi which 


(6.50) 


(6.51) 


(6.52) 



f(p) and F(p, 1) = g(p)- We now prove that /* : H' (N ) — »■ H r (M ) is equal 
to g* : H'\N) -> H r (M). 

Lemma 6.1. Let f* and g* be defined as before. If o> e Q r (N) is a closed form, 
the difference of the pullback images is exact, 

f*co- g*co = df (6.55) 

Proof. We first note that 

/ = F o /o, g = F o /i 

where f, : M -> M x / (p h> ( p, f)) has been defined in theorem 6.3. The 
LHS of (6.55) is 

(F o f 0 )*w - (F o fffto = f* o F*a> - f* o F*m 

= - [dP(F*oj) + P d(F*a>)] = -d P F*w 

where (6.33) has been used. This shows that f*co — g*a> = d (—PF*co). □ 

Now it is easy to see that /* = g* as the pullback maps H' (N) — »■ H' (M). 
In fact, from the previous lemma, 

[f*0> - g*( 0 ] = [f*co\ - [g*a>] = [df] = 0. 


We have established the following theorem. 

Theorem 6.4. Let /. g : M — > /V he maps which are homotopic to each 
other. Then the pullback maps /* and g* of the de Rham cohomology groups 
H' (N ) — »■ H r (M) are identical. 

Let M be a simply connected manifold, namely jti(M) = {0}. Since 
// 1 ( M ) = Jti(M) modulo the commutator subgroup (theorem 4.9), it follows 
that H\ (M) is also trivial. In terms of the de Rham cohomology group this can be 
expressed as follows. 

Theorem 6.5. Let M be a simply connected manifold. Then its first de Rham 
cohomology group is trivial. 

Proof. Let o> be a closed one-form on M. It is clear that if a> — d /, then a function 
/ must be of the form 

fip)= [ P co (6.56) 

Jpo 

po e M being a fixed point. 

We first prove that an integral of a closed form along a loop vanishes. Let 
a : I —¥■ M be a loop at p e M and let c p : / -> M (t !->• p) be a constant 



loop. Since M is simply connected, there exists a homotopy F(s. t) such that 
F(s, 0) = a(s) and F (s, 1) = c p (s). We assume F : I x I — > M is smooth. 
Define the integral of a one-form co over tf (7 ) by 



(6.57) 


where we have taken the integral domain in the RHS to be S 1 since I — [0, 1] in 
the LHS is compactified to S 1 . From lemma 6.1, we have, for a closed one-form 
co, 

oi*co — c*co = dg (6.58) 

where g — —PF*co. The pullback c p w vanishes since c p is a constant map. Then 
(6.57) vanishes since 35 1 is empty, 

f a*co = f dg = f g = 0. (6.59) 

Js l Js 1 JdS 1 

Let P and y be two paths connecting po and p. According to (6.59), integrals 
of co along p and along y are identical, 




co. 


This shows that (6.56) is indeed well defined, hence co is exact. □ 


Example 6.6. The n-sphere S n (n > 2) is simply connected, hence 

H l (S n ) = 0 n > 2. (6.60) 

From the Poincare duality, we find 

H°(S") = H"(S n ) = EL (6.61) 

It can be shown that 

H r (S n ) = 0 l < r < n — l. (6.62) 


H n (S n ) is generated by the volume element Q. Since there are no (n + 1 )-forms 
on S' 1 , every n-form is closed. Q cannot be exact since if £2 = d \ji , we would 
have 




xp — 0. 


The Euler characteristic is 


X(S")= 1 + (-!)' 


0 n is odd, 
2 n is even. 


(6.63) 



Example 6. 7. Take S 2 embedded in M 3 and define 

£2 = sin 9 dd A dcp (6.64) 

where (6,(p) is the usual polar coordinate. Verify that £2 is closed. We may 
formally write £2 as 

£2 = — d(cosd) Ad (f> = — d(cos 6 dtp). 

Note, however, that £2 is not exact. 
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RIEMANNIAN GEOMETRY 


A manifold is a topological space which locally looks like IK" . Calculus on a 
manifold is assured by the existence of smooth coordinate systems. A manifold 
may carry a further structure if it is endowed with a metric tensor, which is 
a natural generalization of the inner product between two vectors in IK" to an 
arbitrary manifold. With this new structure, we define an inner product between 
two vectors in a tangent space T p M. We may also compare a vector at a point 
p e M with another vector at a different point p' e M with the help of the 
‘connection’ . 

There are many books about Riemannian geometry. Those which are 
accessible to physicists are Choquet-Bruhat et al (1982), Dodson and Poston 
(1977) and Hicks (1965). Lightman et al (1975) and chapter 3 of Wald (1984) 
are also recommended. 

7.1 Riemannian manifolds and pseudo-Riemannian manifolds 

7.1.1 Metric tensors 

In elementary geometry, the inner product between two vectors V and V is 
defined by U ■ V = Y^,T=i Vi where U t and V, ] are the components of the 
vectors in IK'” . On a manifold, an inner product is defined at each tangent space 
T P M. 

Definition 7.1. Let M be a differentiable manifold. A Riemannian metric g on 
M is a type (0, 2) tensor field on M which satisfies the following axioms at each 
point p e M: 

(i) g p (U, V) = g p (V, U ), 

(ii) gp(U, U) > 0, where the equality holds only when U — 0. 

Here [/, V e T p M and g p — g\ p . In short, g p is a symmetric positive-definite 
bilinear form. 

A tensor field g of type (0, 2) is a pseudo-Riemannian metric if it satisfies 
(i) and 

(ii') if g p (U, V) = 0 for any U e T p M, then V = 0. 



In chapter 5, we have defined the inner product between a vector V e Tm 
and a dual vector u> e T p *M as a map ( , } : T*M x T p M — »■ K. If there 
exists a metric g, we define an inner product between two vectors U,Ve T p M 
by g p (U , V). Since g p is a map T p M <g> T p M —*■ IE we may define a linear 
map g p (U , ) : T p M — >■ ffi by V i->- g p (U, V). Then g p (U, ) is identified 
with a one-form cop e T*M. Similarly, a> e T*M induces V,„ e T p M by 
(®, U) = g(Va), U). Thus, the metric g p gives rise to an isomorphism between 
T p M and T*M. 

Let ( U , <p) be a chart in M and { x the coordinates. Since g e T^{M), it is 
expanded in terms of dx ,L ® dx 1 ' as 

gp = gpv(p ) ® dx y . (7.1a) 


It is easily checked that 


gMp) = g P (J^,-£^)=gv»(p) (peM). (7.1b) 

We usually omit p in g pv unless it may cause confusion. It is common to 
regard (gp, v ) as a matrix whose (ji, v)th entry is g flv . Since {gp, v ) has the 
maximal rank, it has an inverse denoted by (g^ v ) according to the tradition: 
gpvg vX = g Xv gvp, = The determinant det(g^„) is denoted by g. Clearly 
det(g /iv ) = g~ l . The isomorphism between T p M and T*M is now expressed as 

co„ = g pv U\ U 11 = g^cop. (7.2) 


From (7.1a) and (7.1b) we recover the ‘old-fashioned’ definition of the 
metric as an infinitesimal distance squared. Take an infinitesimal displacement 
dx^d/dx 11 e T p M and plug it into g to find 


ds 2 = g I d.r M , dx v 

= g/iv dx u . 


dx' 


— d.r M dx 1 ’ g 


dx 11 dx v 


(7.3) 


We also call the quantity d.v 2 = g pv dx /J dx 1 ' a metric, although in a strict sense 
the metric is a tensor g — g pv dx M ® dx 1 '. 

Since (g ,lv ) is a symmetric matrix, the eigenvalues are real. If g 
is Riemannian, all the eigenvalues are strictly positive and if g is pseudo- 
Riemannian, some of them may be negative. If there are i positive and j negative 
eigenvalues, the pair ( i , j) is called the index of the metric. If j = 1, the metric 
is called a Lorentz metric. Once a metric is diagonalized by an appropriate 
orthogonal matrix, it is easy to reduce all the diagonal elements to ± 1 by a suitable 
scaling of the basis vectors with positive numbers. If we start with a Riemannian 
metric we end up with the Euclidean metric S = diag(l, . . . , 1) and if we start 
with a Lorentz metric, the Minkowski metric rj = diag(— 1, 1, . . . , 1). 


If (M , g) is Lorentzian, the elements of T p M are divided into three classes 
as follows, 


(i) 

(ii) 

(iii) 


g(U, U) > 0 
g(U, U) = 0 
g(U, U) < 0 


U is spacelike. 

U is lightlike (or null), 
U is timelike. 


Exercise 7.1. Diagonalize the metric 


( 0 

1 

0 

0 ^ 

1 

0 

0 

0 

0 

0 

1 

0 

l 0 

0 

0 

1 ) 


(7.4) 


to show that it reduces to the Minkowski metric. The frame on which the 
metric takes this form is known as the light cone frame. Let jeo, e \ , ei. ej } 
be the basis of the Minkowski frame in which the metric is g llv = q llv . Show 
that {e+, e_, C 2 , e^} are the basis vectors in the light cone frame, where e± = 
(ei ± eo )/'\/2. Let V — (L + , V~, V 2 , V 3 ) be components of a vector V. Find 
the components of the corresponding one-form. 

If a smooth manifold M admits a Riemannian metric g, the pair (M, g) is 
called a Riemannian manifold. If g is a pseudo-Riemannian metric, (M, g) 
is called a pseudo-Riemannian manifold. If g is Lorentzian, (M, g) is called 
a Lorentz manifold. Lorentz manifolds are of special interest in the theory 
of relativity. For example, an m -dimensional Euclidean space (TR m , S) is a 
Riemannian manifold and an m -dimensional Minkowski space (!R m , q) is a 
Lorentz manifold. 

7.1.2 Induced metric 

Let M be an m -dimensional submanifold of an n -dimensional Riemanian 
manifold N with the metric g.v. If / : M /V is the embedding which induces 
the submanifold structure of M (see section 5.2), the pullback map /* induces 
the natural metric gM — f*gN on M. The components of gM are given by 

df a dfP 

gM^vix) = gNapifix))-^— (7.5) 

where /“ denote the coordinates of fix). For example, consider the metric of the 
unit sphere embedded in (K 3 , <$). Let (9, <p) be the polar coordinates of S 2 and 
define / by the usual inclusion 

/ : (9, 4>) i-^ (sind cos <f>, sinO sin <p, cos 9) 



from which we obtain the induced metric 

df a dfP 

g nv dx' x <g> dx = 8 a pf - - — dx M <g> dx 
9x^ 9x y 

= df) <g> dd + sin 2 0 d(p ® dcp. (7.6) 

Exercise 7.2. Let / : T 2 — > M 3 be an embedding of the toms into (M 3 , 5) defined 
by 

/ : (9. <p) i-> (( R + r cos0) cos (p. {R + r cosd ) sirup, r sinO) 
where R > r. Show that the induced metric on T 2 is 

g = r 2 Ad <g> AG + (R + r cos0) 2 d cp ® d (p. (7.7) 

When a manifold N is pseudo-Riemannian, its submanifold / : M — > N 
need not have a metric f*gN- The tensor f * gN is a metric only when it has a 
fixed index on M. 

7.2 Parallel transport, connection and covariant derivative 

A vector A is a directional derivative acting on / e S' (M) as X : f i-> X[f\. 
However, there is no directional derivative acting on a tensor field of type (p, q), 
which arises naturally from the differentiable structure of M. [Note that the Lie 
derivative CyX = [V, X] is not a directional derivative since it depends on the 
derivative of V .] What we need is an extra structure called the connection, which 
specifies how tensors are transported along a curve. 

7.2.1 Heuristic introduction 

We first give a heuristic approach to parallel transport and covariant derivatives. 
As we have noted several times, two vectors defined at different points cannot be 
compared naively with each other. Let us see how the derivative of a vector field 
in a Euclidean space TR m is defined. The derivative of a vector field V — 
with respect to x v has the /zth component 

a v ^(. . . , x v + Ax y , . . .) - . . , x y , . . .) 

= lim . 

9x y A.v-s-0 Ax v 

The first term in the numerator of the LHS is defined at x + Ax = (x, . . . , x y + 
Ax y , . . . , x m ), while the second term is defined at x = (x^). To subtract V^ix) 
from y^(x + Ax), we have to transport V ll (x) to x + Ax without change and 
compute the difference. This transport of a vector is called a parallel transport. 
We have implicitly assumed that V \ x parallel transported to x + Ax has the same 
component V^fx). However, there is no natural way to parallel transport a vector 
in a manifold and we have to specify how it is parallel transported from one point 



to the other. Let V\ x +Ax denote a vector V\ x parallel transported to x + Ax. We 
demand that the components satisfy 


V^x + Ax) - y^(x) a Ax (7.8a) 

{V^+W^ix + Ax) = V A (x + Ax) + W^ix + Ax). (7.8b) 
These conditions are satisfied if we take 

V IJ '(x + Ax) = V' l (x) - V l (x)r ll vl (x)Ax v . (7.9) 


The covariant derivative of V with respect to x v is defined by 

V^(x + Ax) - V»(x + Ax) 9 (dV 11 , „ \ 9 

lim = h V r^ v x . 

Ax v ^o Ax y dx> L \ 9x y J 9x ^ 


(7.10) 


This quantity is a vector at x + Ax since it is a difference of two vectors VU+a.y 
and V\ x +a.x defined at the same point x + Ax. There are many distinct rules 
of parallel transport possible, one for each choice of T. If the manifold is 
endowed with a metric, there exists a preferred choice of T, called the Levi-Civita 
connection, see example 7.1 and section 7.4. 


Example 7.1. Let us work out a simple example: two-dimensional Euclidean 
space (M 2 ,5). We define parallel transportation according to the usual sense 
in elementary geometry. In the Cartesian coordinate system (x, y), all the 
components of T vanish since V ,l (x + Ax, y + Ay) = V M (x, y) for any Ax 
and Ay. Next we take the polar coordinates (r, cp). If (r, (p) m>- (r cos <p, r sin <p ) 
is regarded as an embedding, we find the induced metric, 

g = dr ® dr + r 2 d(p ® dcp. (7.11) 

Let V = V r d/dr + V^d/dcp be a vector defined at (r, <p). If we parallel transport 
this vector to ( r + A r,(p), we have a new vector V = V' d/dr |( r+ Ar </>) + 
V ^ 9/90| ( r+ Ar.<j>) (figure 7.1(a)). Note that V r — V cost? and V & — V(sin 6/r), 
where V = x/g< V . V) and 0 is the angle between V and d/dr. Then we have 
V r = V r and 

~ r A »* 

= V t ~ v*. 

r + A r r 


By comparing these components with (7.9), we easily find that 


r \ r = 0 


T r r<j> = 0 


= 0 r% = -. (7.12a) 


Similarly, if V is parallel transported to (r, (p + A <p), it becomes 


V = W 


9 

dr 


V^- 


(r.<p+A<l>) 





Figure 7.1. V is a vector V parallel transported to (a) ( r + A r, (j > ) and ( b ) ( r , <j> + A <j>). 


where 

V' — V cos(# — A 0) ~ V cos 0 + V sin# A — V r + V^rAfi 


and 

sin(# — A d>) sin# A <b . ..A <p 

r r r r 

(figure 7. 1 (Z?)). Then we find 

rv = o r^ = -r rV = ^ r% = o. (7. 12 b) 

Note that the T satisfy the symmetry r x ^ v = r x vil . It is also implicitly assumed 
that the norm of a vector is invariant under parallel transport. A rule of parallel 
transport which satisfies these two conditions is called a Levi-Civita connection, 
see section 7.4. Our intuitive approach leads us to the formal definition of the 
affine connection. 


7.2.2 Affine connections 

Definition 7.2. An affine connection V is a map V : X(M) x X(M) -> X(M), or 
(X, Y) i->- Vv Y which satisfies the following conditions: 

V X (Y + Z) = V X Y + V X Z (7.13a) 

V (X+Y) Z=V X Z+V Y Z (7.13b) 

= /VzX (7.13c) 

X x (fY) = X[f]Y + fV x Y (7.13d) 


where / e 9T/W) and X.Y,Z e X(M). 


Take a chart ( U , (p) with the coordinate x — (pip) on M, and define m? 
functions T A v/i called the connection coefficients by 

— ^e v e [i — O.T v/i (7.14) 

where {e p } = {3/3.r M } is the coordinate basis in T p M. The connection 
coefficients specify how the basis vectors change from point to point. Once the 
action of V on the basis vectors is defined, we can calculate the action of V on 
any vectors. Let V — V^cp, and W — W v e v be elements of T p ( M ) . Then 

VyW = V^V^(W v e v ) = V^ie^W^e, + W v V e ^e v ) 

SdW 1 , \ 

= v ‘ (^r + lv r “')“■■ <7I5> 

Note that this definition of the connection coefficient is in agreement with the 
previous heuristic result (7.10). By definition, V maps two vectors V and W to a 
new vector given by the RHS of (7.15), whose kth component is p W' where 

v,ff A = ^ + rVr. (7.16) 

Note that V /( is the /,th component of a vector V /( W = V ; , W k e\ and should 
not be confused with the covariant derivative of a component W k . V y W is 
independent of the derivative of V, unlike the Lie derivative CyW — [V, W], 
In this sense, the covariant derivative is a proper generalization of the directional 
derivative of functions to tensors. 


7.2.3 Parallel transport and geodesics 

Given a curve in a manifold M, we may define the parallel transport of a vector 
along the curve. Let c : (a, b) —> M be a curve in M. For simplicity, we assume 
the image is covered by a single chart (U, cp) whose coordinate is x = (pip). Let 
X be a vector field defined (at least) along cit), 

X\ c{t) = X»(c(t))e ll \ c(t) (7.17) 

where e p = d/dx ^ . If X satisfies the condition 

VyX = 0 for any t e ia, b) (7.18a) 


X is said to be parallel transported along c(f) where V = d/dr= 
(d.r^ (c(f))/d?)e |U |c(«) is the tangent vector to cit). The condition (7.18a) is 
written in terms of the components as 


d7G l 


+ 


dx' 


vX “ 


( -^lx k = 0 . 


dt 


dt 


(7.18b) 



If the tangent vector V (t) itself is parallel transported along c(t), namely if 

V v V = 0 (7.19a) 

the curve c(t) is called a geodesic. Geodesics are, in a sense, the s traightest 
possible curves in a Riemannian manifold. In components, the geodesic 
equation (7.19a) becomes 


d 2 x^ „ dx y dx A 
dt 2 dr dr 


= 0 


(7.19b) 


where [x 11 } are the coordinates of c(r). We might say that (7.19a) is too strong to 
be the condition for the straightest possible curve, and instead require a weaker 
condition 

V v V = fV (7.20) 

where / e T ( M ) . ‘Change of V is parallel to V’ is also a feature of a straight 
line. However, under the reparametrization t —*■ t' , the component of the tangent 
vector changes as 

dx^ 1 dr dx ^ 

dr dr' dr 

and (7.20) reduces to (7.19a) if t 1 satisfies 


dV _ dr' 
dr 2 f dr 


Thus, it is always possible to reparametrize the curve so that the geodesic equation 
takes the form (7.19a). 


Exercise 7.3. Show that (7. 19b) is left invariant under the affine reparametrization 
r — > at + b (a, b e M). 


7.2.4 The covariant derivative of tensor fields 

Since Vx has the meaning of a derivative, it is natural to define the covariant 
derivative of / e 7(M ) by the ordinary directional derivative: 

Vx/ = *[/]■ (7.21) 

Then (7.13d) looks exactly like the Leibnitz rule, 

Vx(/L) = (Vx/)F + /VxT. (7.13d') 

We require that this be true for any product of tensors. 


Vx(7) <g> 73) = (Vx7’i) ® 73 + T\ ® (Vx73) 


(7.22) 



where 7) and Ti are tensor fields of arbitrary types. Equation (7.22) is also true 
when some of the indices are contracted. With these requirements, we compute 
the covariant derivative of a one-form o> e Q l ( M ). Since {ax, Y) e ‘J(M ) for 
Y eX(M), we should have 


X[(a>, F>] = V x [(ax, F>] = (V x ax, Y) + (ax, V X Y). 


Writing down both sides in terms of the components we find 
(Via), = x»d„co v - x»r x ^o) X . 

In particular, for X = c /; , we have 

(y„CO) V — 3 ^yOXX- 

For ax = dx v , we obtain (cf (7.14)) 

w fl dx v = -r v ^dx\ 

It is easy to generalize these results as 


M...k p _ X\...X p kX 2 ...X p 

v V l ll\...flq — u V L ll\...llq i 1 u/rtr 




+ 


I il...ip-1/C 

' 1 VK‘tn...n q 


M- x p 


- r* 




1 V/i 1 '(f/i 2 .../U< z 

Exercise 7.4. Let g be a metric tensor. Verify that 

( X v g)xfi — dv8Xfi r vXSicfi r v/j,§Xk- 


(7.23) 


(7.24) 


(7.25) 


(7.26) 


(7.27) 


7.2.5 The transformation properties of connection coefficients 

Introduce another chart (V, ifr) such that U fl V / 0, whose coordinates are 
y = i fr(p). Let {e^} = {3/3.^} and {f a } = { 3/3y Q ' } be bases of the respective 
coordinates. Denote the connection coefficients with respect to the y-coordinates 
by r 01 p y . The basis vector f a satisfies 

V/.//J = r r a pfy. 

If we write f a = (dx^ /dy a )e pL , the LHS becomes 

/ dx^ \ d 2 x^ dx x dx ^ 

V f a /P ~ V fc [dp' e ^j ~ dyad yfi e V + 

/ 3 2 .r y dx x dx^ v 

l dy a dyP 3 y a dyP ^ 



(7.28) 



Since the RHS of (7.28) is equal to T Y a p(dx v /d y Y )e v , the connection coefficients 
must transform as 


dx x dx" dy Y v d 2 x v dy Y 

3 y a 3 yP dx v ^ dy a dyP dx v 


(7.29) 


The reader should verify that this transformation rule indeed makes Vx F a vector, 
namely 

X a (d a Y Y + f *apY p )fy = X x (d X Y V + r v Xll Y v )e v . 

In the literature, connection coefficients are often defined as objects which 
transform as (7.29). From our viewpoint, however, they must transform according 
to (7.29) to make VxF independent of the coordinate chosen. 


Exercise 7.5. Let T be an arbitrary connection coefficient. Show that T x ^ v + t x ^, v 
is another connection coefficient provided that t' llv is a tensor field. Conversely, 
suppose r'> and T x ^ are connection coefficients. Show that T A |ttv — T^y is a 
component of a tensor of type (1,2). 


7.2.6 The metric connection 

So far we have left T arbitrary. Now that our manifold is endowed with a metric, 
we may put reasonable restrictions on the possible form of connections. We 
demand that the metric g ^ be covariantly constant, that is, if two vectors X 
and Y are parallel transported along any curve, then the inner product between 
them remains constant under parallel transport. [In example 7.1, we have already 
assumed this reasonable condition.] Let V be a tangent vector to an arbitrary 
curve along which the vectors are parallel transported. Then we have 

0 = Vy[g(X, F)] = V K [(V K g)(X, Y ) + g(V K X, F) + g(X, V e F)] 

= v K x^ l y v (y K g), JLV 

where we have noted that V t X = V K Y — 0. Since this is true for any curves and 
vectors, we must have 

(Xk8)^v = 0 (7.30a) 

or, from exercise 7.4, 

<hg,u, - r K x„g KV - r \ v g Kfl = 0. (7.30b) 

If (7.30a) is satisfied, the affine connection V is said to be metric compatible or 
simply a metric connection. We will deal with metric connections only. Cyclic 
permutations of (A, /r, v) yield 


3/^gyA L fj,vgKX T [ixgicv — 0 

9u<?A/a r vX§kh r v^SkX — o. 


(7.30c) 

(7.30d) 



The combination —(7.30b) + (7.30c) + (7.30d) yields 

dxS/iv T - d/igvX T - T" T Xjigicv T - T Xv^k/jl ~ 2T (/j / v)8kX = 0 (7.31) 

where T K Xfl = 2T\ m = r K x „ - T\ x and r K (#tu) = ±(r\„ + T\ v ). The 
tensor T K Xfl is anti-symmetric with respect to the lower indices T K Xfl = —T K tlX 
and called the torsion tensor, see exercise 7.6. The torsion tensor will be studied 
in detail in the next section. Equation (7.31) is solved for ( /IV) to yield 

r ' C( ^ ) ={^} + ^(7’rV + Vj (7-32) 

where { * v ) are the Christoffel symbols defined by 

^ y | = \g KX (d/igvX + d v g, lX - 3 xg^) . (7.33) 

Finally, the connection coefficient F is given by 

p/C _i_ p/C 

1 iiv — 1 (/xv) ' 1 [/x v] 

= + ^ + r /, + r V (7-34) 

The second term of the last expression of (7.34) is called the contorsion, denoted 
by K K ^ v : 

K\ v = \{T\ V + T„\ + T v \). (7.35) 

If the torsion tensor vanishes on a manifold M, the metric connection 
V is called the Levi-Civita connection. Levi-Civita connections are natural 
generalizations of the connection defined in the classical geometry of surfaces, 
see section 7.4. 

Exercise 7.6. Show that T K obeys the tensor transformation rule. [Hint: Use 
(7.29).] Show also that K K [/XV ] = \T K ^ and K ki , v = -K VilK where K KI1V = 

8kxK fix;. 

7.3 Curvature and torsion 
7.3.1 Definitions 

Since F is not a tensor, it cannot have an intrinsic geometrical meaning as 
a measure of how much a manifold is curved. For example, the connection 
coefficients in example 7.1 vanish if the Cartesian coordinate is employed while 
they do not in polar coordinates. As intrinsic objects, we define the torsion tensor 



T : X(Af) <g> X(M) -» X(M) and the Riemann curvature tensor (or Riemann 
tensor) R : X(M) ® X(M) <g> X(M) -> X(M) by 

T(X, Y) = V X F - V y Z - [X, 7] (7.36) 

fl(X, F, Z) = V x V y Z - V F V X Z - V [X ,F]Z. (7.37) 

It is common to write R(X , F)Z instead of R(X, F, Z), so that R looks like an 
operator acting on Z. Clearly, they satisfy 

T(X, F) = —T (F, X), R(X, F)Z = -R(Y, X)Z. (7.38) 

At first sight, T and R seem to be differential operators and it is not obvious that 
they are multilinear objects. We prove the tensorial property of R, 

R(fX, gY)hZ = fV x {gV Y (hZ)} - g V Y {fV x (hZ)} - fX[g]V Y (hZ ) 

+ gY[f]X x (hZ) - fgX [X ,Y](hZ) 

= fgVx{Y[h]Z + hV Y Z } - gfX Y {X[h]Z + hX x Z } 
-fg[X, Y][h]Z — f gh'V[ X , Y ]Z 
= fgh{V x V Y Z - VyVxZ - V[x,f]Z} 

= fgh R(X, F)Z. 

Now it is easy to see that R satisfies 

R(X, F)Z = X x Y»Z v R(e x , e^)e v (7.39) 

which verifies the tensorial property of R. Since R maps three vector fields to a 
vector field, it is a tensor field of type (1,3). 

Exercise 7.7. Show that T defined by (7.36) is multilinear, 

T(X, F) = X /x F 1 '7’(e /x , e v ) (7.40) 

and hence a tensor field of type (1,2). 

Since T and R are tensors, their operations on vectors are obtained once their 
actions on the basis vectors are known. With respect to the coordinate basis {e^} 
and the dual basis {d.r^}, the components of these tensors are given by 

T X fj, v = (d.r\ T{e e v )) = (d.r\ - V,,^) 

= (dx x , Y\ v e tl - = r x M y - r k v „ (7.41) 

and 


R K Xnv = (dx K , Rie^, e v )ex) = (dx K , X^V v ex - V„ V /t e A ) 

= (dx\ v^r^xer,) - v^rv*,)) 

= (dx K , Onr\x)ei + r\ k r ^ ~ {d v Y\x)e„ - 

= 3^r% A - a y r> + r" vA r% ); - rv^V (7.42) 


We readily find (cf (7.38)) 


r A r T'A 

/XV — 1 V/jL 


K 

k/lV — 


-R K 


R‘ 


kv/x- 


(7.43) 



C' 



Figure 7.2. It is natural to define V parallel transported along a great circle if the angle V 
makes with the great circle is kept fixed. If V at p is parallel transported along great circles 
C and C ' , the resulting vectors at q point in opposite directions. 



Figure 7.3. A vector Vq at p is parallel transported along C and C' to yield Vp(r) and 
Vpt(r) at r. The curvature measures the difference between two vectors. 


7.3.2 Geometrical meaning of the Riemann tensor and the torsion tensor 

Before we proceed further, we examine the geometrical meaning of these tensors. 
We consider the Riemann tensor first. A crucial observation is that if we parallel 
transport a vector V at p to q along two different curves C and C' , the resulting 
vectors at q are different in general (figure 7.2). If, however, we parallel transport 
a vector in a Euclidean space, where the parallel transport is defined in our 
usual sense, the resulting vector does not depend on the path along which it 
has been parallel transported. We expect that this non-integrability of parallel 
transport characterizes the intrinsic notion of curvature, which does not depend 



on the special coordinates chosen. Let us take an infinitesimal parallelogram 
pqrs whose coordinates are {x ,l \, { x p + e ^}, { x p + e p + 8 P } and { x p + <5^ } 
respectively, s 1 ' and 8 1 ' being infinitesimal (figure 7.3). If we parallel transport 
a vector Vo € T p M along C — pqr , we will have a vector Vc(r) e T r M. The 
vector Vo parallel transported to q along C is 

V'c(q) = V p - V^ VK {p)s v . 

Then v£ (r) is given by 

Vg(r) = V p (q) - V£(q)r fl VK (q)8 v 

= Vq - v ( «r'\ K e v - [Vo - vgr K (p (p)e s ] 
x [r p vlc (p) + d x r» VK (p)e l ]8 v 
- v p - Vq r^ VK (p)s v - v 0 *T P VK (P)S V 

- vtft&r p vA P ) - r p x K (p)r p vp (p)]e^8 v 

where we have kept terms of up to order two in e and 8. Similarly, parallel 
transport of Vo along C' — psr yields another vector Vc'(r) e T r M, given by 

v£,(r) — V p - V ( «r<\Ap)S v - Vo'T » VK (p)e v 

- V 0 K [d v r p x<(p) - rP VK {p)T p x P {p)]e k 8 v . 

The two vectors at r differ by 

Vc'(r) - Vc(r) = V^[d x V> J VK ( P ) - dv^ip) 

- r p xAp)^ vp ( P ) + r p VK {p)r p Xp ( P )]e x 8 v 

= Vq R p kXv s x 8 v . ( 7 . 44 ) 

We next look at the geometrical meaning of the torsion tensor. Let p e M 
be a point whose coordinates are {x 11 }. Let X = and Y = 8 p e p be 

infinitesimal vectors in T p M. If these vectors are regarded as small displacements, 
they define two points q and s near p, whose coordinates are {x p + s 1 ' } and 
{x 11 + iV 4 } respectively (figure 7.4). If we parallel transport X along the line ps , 
we obtain a vector sr i whose component is 8 11 — s x T p, vX 8 v . The displacement 
vector connecting p and r\ is 

pr\ — ps + sr\ = 8 P + — T p vX s x S v . 

Similarly, the parallel transport of 8 1 ' along pq yields a vector 

pr 2 — pq + qn = e p + 8 P - Y p Xv e x 8 v . 


In general, n and n do not agree and the difference is 

nn = pn - pn = (r^ - r^ v ) e V = t p vX s x 8 v . 


( 7 . 45 ) 



Figure 7.4. The vector qrj (fr; ) is the vector ps ( pq) parallel transported to q (s). In 
general, r\ 7^ ri and the torsion measures the difference rjr \ . 


Thus, the torsion tensor measures the failure of the closure of the parallelogram 
made up of the small displacement vectors and their parallel transports. 

Example 7.2. Suppose we are navigating on the surface of the Earth. We define a 
vector to be parallel transported if the angle between the vector and the latitude is 
kept fixed during the navigation. [ Remarks : This definition of parallel transport 
is not the usual one. For example, the geodesic is not a great circle but a straight 
line on Mercator’s projection. See example 7.5.] Suppose we navigate along 
a small quadrilateral pqrs made up of latitudes and longitudes (figure 7.5(a)). 
We parallel transport a vector at p along pqr and psr, separately. According 
to our definition of parallel transport, two vectors at r should agree, hence the 
curvature tensor vanishes. To find the torsion, we parametrize the points p,q,r 
and s as in figure 7.5 (b). We find the torsion by evaluating the difference between 
pr 1 and pn as in (7.45). If we parallel transport the vector pq along ps, we 
obtain a vector sr\, whose length is R sin 0d0. However, a parallel transport 
of the vector ps along pq yields a vector qrj — qr. Since sr has a length 
R sin(0 — d0) d <p — R sin 6 d<p ~ R cos 6 dd d 0, we find that r\rs has a length 
R cos0 dO dtp. Since r 1 ri is parallel to —3/30, the connection has a torsion 
t%, see (7.45). From = R 2 sin 2 0, we find that nr 2 has components 
(0, — cot 9 dd d(f>) . Since the 0-component of n ^'2 is equal to g^dOdcf), we 
obtain T^g^ — — cot0. 

Note that the basis {3/30, 3/30} is not well defined at the poles. It is known 
that the sphere S 2 does not admit two vector fields which are linearly independent 
everywhere on S 2 . Any vector field on S 2 must vanish somewhere on S 2 and 


(0 - d0, <p + dp) 


(a) 



W 

(6 — dO, ip) 



Figure 7.5. (a) If a vector makes an angle a with the longitude at p, this angle is kept fixed 
during parallel transport. ( b ) The vector jtj (qrj) is the vector pq (ps) parallel transported 
to s (q). The torsion does not vanish. 


hence cannot be linearly independent of the other vector field there. If an m- 
dimensional manifold M admits m vector fields which are linearly independent 
everywhere, M is said to be parallelizable. On a parallelizable manifold, we 
can use these m vector fields to define a tangent space at each point of M. A 
vector V p e T p M is defined to be parallel to V q e T q M if all the components of 
V p at T p M are equal to those of V q at T q M. Since the vector fields are defined 
throughout M, this parallelism should be independent of the path connecting p 
and q , hence the Riemann curvature tensor vanishes although the torsion tensor 
may not in general. For S m , this is possible only when in — 1,3 and 7, which is 
closely related to the existence of complex numbers, quaternions and octonions, 
respectively. For definiteness, let us consider 


S 3 


(x\x 2 ,x 3 ,x 4 ) 


J >') 2 


i = 1 



embedded in (K 4 ,5). Three orthonormal vectors 

ei(jc) = (— x 2 , x 1 , —x 4 , x 3 ) 
e 2 (x) = (— x 3 , x 4 , x 1 , — x 2 ) 
e 3 (jc) = (—x 4 , —x 3 , x 2 , x 1 ) 


(7.46) 


are orthogonal to x — (x 1 , x , x , x 4 ) and linearly independent everywhere 
on S 3 , hence define the tangent space T X S 3 . Two vectors V i (x) and V 2 (y) 


are parallel if V \(x) = c l c, (x ) and Vj(y) = ^c l e, (y). The connection 
coefficients are computed from (7.14). Let eei(x) be a small displacement 
under which x = (x 1 , x 2 , x 3 ,x 4 ) changes to x' = x + eei(x) = {x 1 — 
ex 2 , x 2 + ex 1 , x 3 — ex 4 , x 4 + ex 3 }. The difference between the basis vectors 
at x and x' is e 2 (x' ) — e 2 (x) = (— x 3 — ex 4 , x 4 + ex 3 , x 1 — ex 2 , — x 2 — ex 1 ) — 
(— x 3 , x 4 , x 1 , — x 2 ) = — e« 3 (x) = eV^ne^ix), hence T 3 i 2 = — 1 , T 1 12 = 
T 2 i 2 = 0. Similarly, F 3 2 i = 1 hence we find 7’ 3 i2 = —2. The reader 
should complete the computation of the connection coefficients and verify that 
T x n „ = —2 (+2) if (A ,/rv) is an even (odd) permutation of (123) and vanishes 
otherwise. 

Let us see how this parallelizability of S 3 is related to the existence of 
quaternions. The multiplication rule of quaternions is 

(x\x 2 ,x 3 ,x 4 )- (y\y 2 ,y\y 4 ) 

= (x V - x 2 y 2 - x 3 y 3 - x 4 y 4 , x 1 y 2 + x 2 y 3 + x 3 y 4 - x 4 y 3 , 
x 1 y 3 - x 2 / + X 3 / + x 4 y 2 , x 1 / + x 2 y 3 - x 3 y 2 + xN 1 ). (7.47) 

S 3 may be defined by the set of unit quaternions 

S 3 = {(x\x 2 ,x 3 ,x 4 )|x x = 1} 

where the conjugate of x is defined by x — (x 1 , — x 2 , — x 3 , — x 4 ). According to 
(7.46), the tangent space at xo = (1, 0, 0, 0) is spanned by 

ci =(0, 1,0,0) e 2 = (0,0, 1,0) c 3 = (0,0,0, 1). 

Then the basis vectors (7.46) of the tangent space at x — (x 1 , x 2 , x 3 , x 4 ) are 
expressed as the quaternion products 

C 1 (x ) = C 1 • X C 2 (x) = C 2 'X C 3 (x) = C 3 -X. (7.48) 

Because of this algebra, it is always possible to give a set of basis vectors at an 
arbitrary point of S 3 once it is given at some point, xo = (1, 0, 0, 0), for example. 

By the same token, a Lie group is parallelizable. If the set of basis vectors 
{V\, , V m } at the unit element e of a Lie group G is given, we can always find 
a set of basis vectors of T g G by the left translation of { Vj , } (see section 5.6), 

{V x ,...,V n }^{X x \ g ,...,X n \ g }. (7.49) 

7.3.3 The Ricci tensor and the scalar curvature 

From the Riemann curvature tensor, we construct new tensors by contracting the 
indices. The Ricci tensor Ric is a type (0, 2) tensor defined by 


Ric(X, Y ) = (dx^, Rie^, Y)X) 


(7.50a) 



whose component is 


Ric^v = Ricle^, e v ) = R 1 ^ v . (7.50b) 

The scalar curvature 1Z is obtained by further contracting indices, 

U = g^'Ricie^ e v ) = g^Ric^. (7.51) 

7.4 Levi-Civita connections 
7.4.1 The fundamental theorem 

Among affine connections, there is a special connection called the Levi-Civita 
connection, which is a natural generalization of the connection in the classical 
differential geometry of surfaces. A connection V is called a symmetric 
connection if the torsion tensor vanishes. In the coordinate basis, connection 
coefficients of a symmetric connection satisfy 

tV = rV (7.52) 

Theorem 7.1. (The fundamental theorem of (pseudo-)Riemannian geometry) 

On a (pseudo- )Riemannian manifold (M, g), there exists a unique symmetric 
connection which is compatible with the metric g. This connection is called the 

Levi-Civita connection. 

Proof. This follows directly from (7.34). Let V be an arbitrary connection such 
that 

pK ^ I I Jfk 

1 llv — i i av 

l/xvj 

where { * v } is the Christoffel symbol and K the contorsion tensor. It was shown 
in exercise 7.5 that T K = T K + t K is another connection coefficient if t is 
a tensor field of type (1, 2). Now we choose t K = — K K so that 

r V = = ^{dugk v + d v g - dxg^). (7.53) 

By construction, this is symmetric and certainly unique given a metric. □ 

Exercise 7.8. Let V be a Levi-Civita connection. 

(a) Let f e 3"(M). Show that 

V M V v / = V y V„/. (7.54) 

(b) Let co e £2 l (M). Show that 


d co = (V„co) v dx /l A dx v . 


(7.55) 



Figure 7.6. On a surface M, a vector V p e T p M is defined to be parallel to V q e T q M if 
the projection of Vq onto T p M is parallel to V p in our ordinary sense of parallelism in R 2 . 


(c) Let a> e ( M) and let U e X(M) be the corresponding vector field: 

— g^ v co v . Show that, for any V e X(M), 

g(V x U, V) = (Vjrfi), V). (7.56) 

Example 7.3. 

(a) The metric on M 2 in polar coordinates is g — dr ® dr + r 2 d (p <g> dtp. 

The non-vanishing components of the Levi-Civita connection coefficients 
are ^ = r -1 and V ^ = —r. This is in agreement with the 

result obtained in example 7.1. 

(b) The induced metric on S 2 is g = Ad ® dO + sin 2 0 dip ® dp. The non- 
vanishing components of the Levi-Civita connection are 

r %0 = - cos <9 sin <9 = cot0. (7.57) 

7.4.2 The Levi-Civita connection in the classical geometry of surfaces 

In the classical differential geometry of surfaces embedded in TO 3 , Levi-Civita 
defined the parallelism of vectors at the nearby points p and q in the following 
sense (figure 7.6). First, take the tangent plane at p and a vector V p at p, which 
lies in the tangent plane. A vector V q at q is defined to be parallel to V p if the 
projection of V q to the tangent plane at p is parallel to V p in our usual sense. 
Now take two points q and s near p as in figure 7.7 and parallel transport the 
displacement vectors pq along ps and ps along pq. If the parallelism is defined 
in the sense of Levi-Civita, the displacement vectors projected to the tangent 
plane at p form a closed parallelogram, hence this parallelism has vanishing 
torsion. As has been proved in theorem 7.1, there exists a unique connection 
which has vanishing torsion, which generalizes the parallelism defined here to 
arbitrary manifolds. 



Figure 7.7. If the parallelism is defined in the sense of Levi-Civita, the torsion vanishes 
identically. 

7.4.3 Geodesics 

When the Levi-Civita connection is employed, we can compute the connection 
coefficients, Riemann tensors and many relations involving these by simple 
routines. Besides this simplicity, the Levi-Civita connection provides a geodesic 
(defined as the straightest possible curve) with another picture, namely the 
shortest possible curve connecting two given points. In Newtonian mechanics, 
the trajectory of a free particle is the straightest possible as well as the shortest 
possible curve, that is, a straight line. Einstein proposed that this property should 
be satisfied in general relativity as well; if gravity is understood as a part of the 
geometry of spacetime, a freely falling particle should follow the straightest as 
well as the shortest possible curve. [Remark: To be precise, the shortest possible 
curve is too strong a condition. As we see later, a geodesic defined with respect 
to the Levi-Civita connection gives the local extremum of the length of a curve 
connecting two points.] 

Example 7.4. In a flat manifold (TR m , S ) or (M m , rj), the Levi-Civita connection 
coefficients T vanish identically. Hence, the geodesic equation (7.19b) is easily 
solved to yield x ,L — A^t + B where A M and B [l are constants. 

Exercise 7.9. A metric on a cylinder S l x IR is given by g — dtp <g> dtp + dj <g> dz, 
where (p is the polar angle of .S' 1 and z the coordinate of ffi. Show that the geodesics 
given by the Levi-Civita connection are helices. 

The equivalence of the straightest possible curve and the local extremum of 
the distance is proved as follows. First we parametrize the curve by the distance s 
along the curve, x 11 — x IL (s). The length of a path c connecting two points p and 
q is 

m = fds = J y g llv .x l i J -x lv ds (7.58) 

where x' 11 = dx 11 /ds. Instead of deriving the Euler-Lagrange equation from 
(7.58), we will solve a slightly easier problem. Let F = \g llv x ,,l x ,v and write 



(7.58) as 7(c) = f c L(F)ds. The Euler-Lagrange equation for the original 
problem takes the form 


cE \ dx' x ) dx x 


Then F — L~ / 2 satisfies 


d / dF \ dF _ [ d f dL\ dL 1 dL dL _ dL dL 

d^ \3x ,A / 3x A d.? \3x ,A / 3 x a _ 3x ,A d.? 3x ,A d.? * 

The last expression vanishes since L = 1 along the curve; dL/ds — 0. Now 
we have proved that F also satisfies the Euler-Lagrange equation provided that L 
does so. We then have 


-igx^x'^) 


„tu,jv 


2 3x A 


- Hibasur'v , d 2 x^ lBg„ v 

x x dj2 2 dx x x 


= g A/i- 


9gx/i dgiv 
dx v 3x^ 


dx M dx v 

— — = °. (7.61) 
d.? d.v 


If (7.61) is multiplied by g , we reproduce the geodesic equation (7. 19b). 

Having proved that L and F satisfy the same variational problem, we take 
advantage of this to compute the Christoffel symbols. Take S 2 . for example. F is 
given by J?(0 /2 + sin 2 0<p ' 2 ) and the Euler-Lagrange equations are 


d 2 <p dd> d 9 

d^ + 2cot0 d7d7 


(7.62a) 


(7.62b) 


It is easy to read off the connection coefficients 
F't’w = T% = cot0, see (7.57). 


= — sin 6 cos 0 and 


Example 7.5. Let us compute the geodesics of S 2 . Rather than solving the 
geodesic equations (7.62) we find the geodesic by minimizing the length of a 
curve connecting two points on S 2 . Without loss of generality, we may assign 
coordinates (0i,0o) and (02, 4>o) to these points. Let cp — </>(0) be a curve 
connecting these points. Then the length of the curve is 




which is minimized when dip/dO = 0, that is <p = tp o- Thus, the geodesic is a 
great circle (0, ipo), 6\ < 0 < 02. [ Remark : Solving (7.62) is not very difficult. 
Let 0 = 6((p ) be the equation of the geodesic. Then 


dO _ de dip d 2 0 _ d 2 9 /d0\ 2 dd d 2 (P 


d.? dip ds d.? 2 dip 2 \ ds J dip d.? 2 

Substituting these into the first equation of (7 .62), we obtain 

d 2 d ( d ip\ d 0 dr<p ( dip \ 

— T ( — | + -sin0cos0 ( — | =0. 

dip 2 \ dj / d(/> d^ 2 \ d.? 

The second equation of (7.62) and (7.64) yields 


d 2 0 / d0 

— - — 2cot 0 ( — 
dip 2 \dip 


If we define f(0) = cot0, (7.65) becomes 

d 2 / 


sin0 cos0 = 0. 


d 0 2+/ -° 

whose general solution is /(0) = cot0 = A cos ip + B simp or 
A sin 0 cos ip + B sin 0 sin ip — cos 0 = 0. 


(7.64) 


(7.65) 


(7.66) 


Equation (7.66) is the equation of a great circle which lies in a plane whose normal 
vector is (A, B, —1).] 


Example 7.6. Let U be the upper half-plane U = {(x, y)|y > 0} and introduce 

the Poincare metric 

dx <g> dx + dv ® dy 

g= 5“- • (7 ' 67) 

y 1 

The geodesic equations are 

x" - —x'y' = 0 (7.68a) 

y 

y"--[x ,2 + 3v , 2 ]=0 (7.68b) 

y 

where x' = dx /d.v etc. The first equation of (7.68) is easily integrated, if divided 
by x\ to yield 

x' _ 1 

~y 2 ~ ~R 


(7.69) 




Figure 7.8. Geodesics defined by the Poincare metric in the upper half-plane. The geodesic 
has an infinite length. 


where R is a constant. Since the parameter .v is taken so that the vector (. x ' , y') 
has unit length, it satisfies (x a -\- y' 2 ) / y 2 — 1. From (7.69), this becomes 
y 2 / R 2 + (y'/y ) 2 = 1 or 


ds = 


dy 


d t 


ys/l - y 2 /R 2 sin r 

where we put y = R sin r. Equation (7.69) then becomes 

,,2 


x' — — — R sin 2 t. 
R 


Now x is solved for t to yield 


f . f dx ds 

J x'ds = J — — 

/ 


dr 


= J R sinf dr = — R cos t + vo- 

Finally, we obtain the solution 

x = — R cos r + .ro y = R sin t (y > 0) 


(7.70) 


which is a circle with radius R centred at (xo, 0). Maximally extended geodesics 
are given by 0 < t < tc (figure 7.8) whose length is infinite, 


r r~ e ds r~ s i 

/ = / ds = / — dr = / 

J Jo+s dr J 0+£ sinr 


dr 


/0+s 

1 1 +cosr n ~ e 

— log 

2 1 - cos r 0+£ 


£ — > 0 


->oo. 


7.4.4 The normal coordinate system 

The subject here is not restricted to Levi-Civita connections but it does take an 
especially simple form when the Levi-Civita connection is employed. Let c{t) be 



a geodesic in ( M , g ) defined with respect to a connection V, which satisfies 


c(0) = p. 


= X = X% e T p M 

p 


(7.71) 


where [e p | is the coordinate basis at p. Any geodesic emanating from p is 
specified by giving X e T p M. Take a point q near p. There are many geodesics 
which connect p and q. However, there exists a unique geodesic c q such that 
c 9 (l) = q. Let X q e T p M be the tangent vector of this geodesic at p. As 
long as q is not far from p, q uniquely specifies X q = X q e p e T p M and 
<p : q -> Xq serves as a good coordinate system in the neighbourhood of p. 
This coordinate system is called the normal coordinate system based on p with 
basis {e^}. Obviously (pip) = 0. We define a map EXP : T p M -> M by 
EXP : X q i-> q . By definition, we have 


<p(EXPX%e p ) = X%. (7.72) 

With respect to this coordinate system, a geodesic c(t ) with c(0) = p and 
c(l) = q has the coordinate presentation 


cpidt)) = = X%t 


(7.73) 


where X q are the normal coordinates of q. 

We now show that Levi-Civita connection coefficients vanish in the normal 
coordinate system. We write down the geodesic equation in the normal coordinate 
system, 


d 2 X ,t „ „ dX y dX 1 „ „ , 

o = -^ 2 - + r» vk (x*t) = r\ x {x K q t)x v q x x q . (7.74) 

Since r M v ^(p)X y X^ = 0 for any X v q at p for which t — 0, we find \" 11 v d p) + 
T ^Xvip) — 0- Since our connection is symmetric we must have 


r» vX (p) = 0 . (7.75) 

As a consequence, the covariant derivative of any tensor t in this coordinate 
system takes the extremely simple form at p, 

Vyr; = X[r;]. (7.76) 

Equation (7.75) does not imply that r /; l; ; vanishes at q ( ^ /? ) . In fact, we 
find from (7.42) that 


R\uv(P) = d p r\ x (p) - d v T\x{p) (7.77) 


hence d p r K vX {p) ^ 0 if R K xiiv(p ) # 0. 



7.4.5 Riemann curvature tensor with Levi-Civita connection 


Let V be the Levi-Civita connection. The components of the Riemann curvature 
tensor are given by (7.42) with 


r A — 

1 /XI’ — 


K 

fXV 


while the torsion tensor vanishes by definition. Many formulae are simplified if 
the Levi-Civita connections are employed. 


Exercise 7.10. 

(a) Let g — dr ® dr + r 2 (d0 <g> dO + sin 2 6 d(f> <g> d </>) be the metric of (M 3 , S), 
where 0 < 0 < jt, 0 < <fi < 2 tt. Show, by direct calculation, that all the 
components of the Riemann curvature tensor with respect to the Levi-Civita 
connection vanish. 

(b) The spatially homogeneous and isotropic universe is described by the 

Robertson- Walker metric, 

g = —dr<g>dr+a 2 (t) ( — ® ^ + r 2 ( d6 <g> d6 + sin 2 0 d</> <g> d0)^) (7.78) 


where k is a constant, which may be chosen to be — 1, 0 or + 1 by a suitable 

rescaling of r and 0 <6 < re, 0 < <p < 2ir. If k = +1, r is restricted to 

0 < r < 1. Compute the Riemann tensor, the Ricci tensor and the scalar 
curvature. 

(c) The Schwarzschild metric takes the from 

( 2M \ 

g — — I 1 I dr ® dr 

H dr <g> dr + r 2 (d0 <g> dd + sin 2 6 dep ® d(f>) (7.79) 

r 

where 0 < 2 M < r, 0 < 0 < tt. 0 < (/) < Compute the Riemann 
tensor, the Ricci tensor and the scalar curvature. [ Remark : The metric (7.79) 
describes a spacetime of a spherically symmetric object with mass M.\ 


Exercise 7.11. Let R be the Riemann tensor defined with respect to the Levi- 
Civita connection. Show that 

R = I ( _ dW _ 9 2 g/xr \ 

hX ^ 1 ' 2 y3x A 3x v dx K dx v dx x dx^ dx lc dx fl J 

K\± r\ v - rf e „r V) 



where R K x^v = gK^R 1 * X/xv- Verify the following symmetries, 


RkXw = -RkXvh (cf (7.43)) (7.80a) 

RkXiiv = —Rxicfiv (7.80b) 

RkX/iv = R^vkX (7.80c) 

Ric, lv = Ric v/l . (7.80d) 


Theorem 7.2. (Bianchi identities) Let R be the Riemann tensor defined with 
respect to the Levi-Civita connection. Then R satisfies the following identities: 

R(X, Y)Z + R(Z, X)Y + R(Y. Z)X = 0 

(the first Bianchi identity) (7.81a) 

(V X R)(Y. Z)V + (V Z R)(X, Y)V + (VyR)(Z, X)V = 0 

(the second Bianchi identity ) . (7.81b) 

Proof. Our proof follows Nomizu (1981). Define the symmetrizor & by 
6{/(X, Y. Z )} = /(X , Y, Z) + f(Z, X, Y) + f(Y , Z, X). Let us prove the 
first Bianchi identity &{R(X, Y)Z] — 0. Covariant differentiation of the identity 
T (X, Y) = X x Y — VyX — [X, Y] = 0 with respect to Z yields 

0= V Z {V X L- V F X-[X, Y]} 

= V Z V Z L - V Z V F X - {V [X ,F]Z + [Z, [X, Y]]} 

where the torsion-free condition has been used again to derive the second equality. 
Symmetrizing this, we have 

0 = 6{V Z V X L - V Z V F X - V lx ,y,Z - [Z, [X, Y]]} 

= ©{V Z V X L - V Z V F X - V [X ,F]Z} = 6{R(X, Y)Z } 

where the Jacobi identity S{[X, [L, Z]]} = 0 has been used. 

The second Bianchi identity becomes &{(XxR)(Y. Z)}V — 0 where & 
symmetrizes (X, Y, Z) only. If the identity R(T(X . Y). Z) V = R(X x K — V y X — 
[X, Y\, Z) V — 0 is symmetrized, we have 

0 = 6{R(V X Y, Z ) - R(VyX, Z) - R([X, Y], Z)}V 
= 6{R(V Z X , Y) - R{X, V Z L) - R([X, T], Z)}V. (7.82) 

If we note the Leibnitz rule, 

V Z {R(X, Y)V} = (V Z R)(X, Y)V 

+ R(X, L)V Z V + R(V Z X, Y)V + R(X, V Z F)V 


(7.82) becomes 


0 = 6{— (V Z R)(X, Y) + [V Z , R(X, K)] - R([X, F], Z)}V. 



The last two terms vanish if R(X , Y)V — {[Vx, Vy] — V[x,y]}V is substituted 
into them, 

6{[V Z , R(X, T)]-R([X, Y],Z)}V 

= S{[Vz, [Vx, Vy]] - [Vz, V[x,yj] - [V[x,y], V z ] + V[[x,y],z]}V 
= 0 

where the Jacobi identities 6{[V Z , [Vx, Vy]]} = 6{[[X, Y], Z]} — 0 have been 
used. We finally obtain 6{(VxR)(F, Z)}V = 0. □ 

In components, the Bianchi identities are 

*Vv + *'W + *'W = o 

(the first Bianchi identity) (7.83a) 

(V*/?) 5 ^„ + (V„tf)* 

kVK + (VvJ?) f ^= 0 

(the second Bianchi identity). (7.83b) 

By contracting the indices § and /x of the second Bianchi identity, we obtain an 
important relation: 


(V K Ric)xv + (V^Av* - (V v Ric) XK = 0. (7.84) 

If the indices 1 and v are further contracted, we have V /x (7?.<5 — 2Ric) ll K = 0 or 

V M G /iu = 0 (7.85) 

where G 1 ' 1 ’ is the Einstein tensor defined by 

G flv = Ric ,lv - (7.86) 


Historically, when Einstein formulated general relativity, he first equated the Ricci 
tensor Ric^ v to the energy-momentum tensor T^ v . Later he realized that T 1 ' 1 ’ 
satisfies the covariant conservation equation V M T llv = 0 while Ric ,lv does not. 
To avoid this difficulty, he proposed that G ,lv should be equated to 7’ /u '. This 
new equation is natural in the sense that it can be derived from a scalar action by 
variation, see section 7.10. 

Exercise 7.12. Let (M, g) be a two-dimensional manifold with g = —dr <X> d; + 
R 2 (t)dx ® dx, where R(t) is an arbitrary function of t. Show that the Einstein 
tensor vanishes. 

The symmetry properties (7.80a)-(7.80c) restrict the number of independent 
components of the Riemann tensor. Let m be the dimension of a manifold ( M , g). 
The anti-symmetry R K >_ /lv — — R)_ KVI i implies that there are N = ('”) independent 
choices of the pair ( /x , r>). Similarly, from R KXf j, v = —Rxk^v, we find there are 



N independent pairs of (k, a). Since R k xixv is symmetric with respect to the 
interchange of the pairs (k, X) and (p, v), the number of independent choices of 
the pairs reduces from N 2 to = ^N(N + 1). The first Bianchi identity 

R/cX/xv + Rk^vX + RkvX[i = 0 (7.87) 


further reduces the number of independent components. The LHS of (7.87) is 
totally anti-symmetric with respect to the interchange of the indices (A., p, v). 
Furthermore, the anti-symmetry (7.80b) ensures that it is totally anti- symmetric 
in all the indices. If m < 4, (7.87) is trivially satisfied and it imposes no additional 
restrictions. If m > 4, (7.87) yields non-trivial constraints only when all the 
indices are different. The number of constraints is equal to the number of possible 
ways of choosing four different indices out of m indices, namely Noting 
that (™) = m(m — 1 ){m — 2)(m — 3)/4! vanishes for m < 4, the number of 
independent components of the Riemann tensor is given by 


F(m) = 





= — m 2 (m 2 — 1). 
12 


(7.88) 


F(l) = 0 implies that one-dimensional manifolds are flat. Since F(2) = 1, there 
is only one independent component R 1212 on a two-dimensional manifold, other 
components being either 0 or ±R 1212- F( 4) = 20 is a well-known fact in general 
relativity. 


Exercise 7.13. Let ( M,g ) be a two-dimensional manifold. Show that the 
Riemann tensor is written as 


R/cX/iv — K (§k n§xv SkvSxii) (7.89) 

where K e 'J(M). Compute the Ricci tensor to show Ric lnl oc g llv . Compute the 
scalar curvature to show K — 72/2. 


7.5 Holonomy 

Let (M, g) be an m -dimensional Riemannian manifold with an affine connection 
V. The connection naturally defines a transformation group at each tangent space 
T p M as follows. 

Definition 7.3. Let p be a point in (M, g) and consider the set of closed loops at 
p, {c(?)|0 < t < 1, c(0) = c(l) = p}. Take a vector X e T p M and parallel 
transport X along a curve c(t). After a trip along c(t), we end up with a new 
vector X c e T p M. Thus, the loop c(f) and the connection V induce a linear 
transformation 

P c : T p M -> T p M. (7.90) 

The set of these transformations is denoted by FI (p) and is called the holonomy 
group at p. 



We assume that H ( p) acts on T p M from the right, P C X — Xh (h e 
H(p)). In components, this becomes P C X — X^h^ev, {e v } being the basis 
of T p M. It is easy to see that H ( p ) is a group. The product P C 'P C corresponds to 
parallel transport along c first and then c' . If we write P,\ = P C 'P C . the loop d is 
given by 


d{t) = 


| c(2t) 
|c'(2t — 1) 






t < 1. 


(7.91) 


The unit element corresponds to the constant map c p (t) — p (0 < t < 1) and 
the inverse of P c is given by P c -i, where c^ 1 (t) — c(l — t). Note that H(p ) is 
a subgroup of GL (m, E), which is the maximal holonomy group possible. H(p) 
is trivial if and only if the Riemann tensor vanishes. In particular, if ( M , g) is 
parallelizable (see example 7.2), we can make H(p) trivial. 

If M is (arcwise-)connected, any two points p, q e M are connected by a 
curve a. The curve a defines a map x a : T p M —> T q M by parallel transporting a 
vector in T p M to T q M along a. Then the holonomy groups //(/?) and H(q ) are 
related by 

// (?) = r~ l H(p)r a (7.92) 


hence H (q) is isomorphic to II ( p). 

In general, the holonomy group is a subgroup of GL(m, 1R). If V is a metric 
connection, V preserves the length of a vector, g p (P c (X), P C (X)) = g p (X, X) 
for X e T p M. Then the holonomy group must be a subgroup of SO (m) if (M, g) 
is orientable and Riemannian and SO (m — 1, 1 ) if it is orientable and Lorentzian. 


Example 7. 7. We work out the holonomy group of the Levi-Civita connection on 
S 2 with the metric g — dd ® A6 + sin 2 dip ® d</>. The non- vanishing connection 
coefficients are T 0 ^ = — sinOcosDand — T^g,p =cotd. For simplicity, 
we take a vector eg = 3/3 9 at a point (6o, 0) and parallel transport it along a 

circle 9 — 9q,Q < (p <2n . Let X be the vector eg parallel transported along the 

circle. The vector X = X 8 eg + X^e^ satisfies 

d^X 0 — sin 0o cos = 0 (7.93a) 

d^X^ + cot9oX e = 0. (7.93b) 


Equations (7.93a) and (7.93b) represent the harmonic oscillations. Indeed if we 
take a (/(-derivative of (7.93a) and use (7.93b), we have 


c ]-yd fivi) a2 v8 

—j- - sin 0 O cos 9 0 ^j- = —j- - cos 2 9 0 X e = 0. (7.94) 

d(j) z dcp d(j) z 

The general solution is X 6 = Acos(Co</>) + B sin(Co<p), where Co = cos0o- 
Since X 1 ' = I at </; = 0 we have 


X 6 — cos(Co</>) 


X* = - 


sin(C 0 </>) 


sin 0o 



After parallel transport along the circle, we end up with 


X(cp — 2i r) = cos(27rCo)e0 


sin(27rCo</>) 

• a e< t 1 ' 

sin 


(7.95) 


Now the vector is rotated by © = 27r cos Oq, with its magnitude kept fixed. If we 
take a point p e S 2 and a circle in S 2 which passes through p, we can always find 
a coordinate system such that the circle is given by 0 — do (0 < 6 < n) and we 
can apply our previous calculation. The rotation angle is —2n <® <2 : r and we 
find that the holonomy group at p e S 2 is SO(2). 


In general, S m (in > 2) admits the holonomy group SO (hi). Product 
manifolds admit more restricted holonomy groups. The following example is 
taken from Horowitz (1986). Consider six-dimensional manifolds made of the 
spheres with standard metrics. Examples are S 6 , S 2 x S 2 , S 2 x S 2 x S 2 , T 6 = 
■S 1 x • • • x ,S' 1 . Their holonomy groups are: 

(i) S 6 :H(p) = SO(6). 

(ii) S 2 x S 2 : H(p ) = SO(3) x SO(3). 

(iii) S 2 x S 2 x S’ 2 : Hip) = SO(2) x SO(2) x SO(2). 

(iv) T 6 : H(p) is trivial since the Riemann tensor vanishes. 


Exercise 7.14. Show that the holonomy group of the Levi-Civita connection of 
the Poincare metric given in example 7.6 is SO(2). 


7.6 Isometries and conformal transformations 
7.6.1 Isometries 

Definition 7.4. Let ( M , g) be a (pseudo-)Riemannian manifold. A diffeomor- 
phism f : M -¥ M is an isometry if it preserves the metric 

f*8f(p) — 8 P (7.96a) 

that is, if g f(p) (f*X, f*Y) = gp (X, Y) for X, Y e T p M. 

In components, the condition (7.96a) becomes 

= (7.96b) 

where x and y are the coordinates of p and / (p), respectively. The identity map, 
the composition of the isometries and the inverse of an isometry are isometries; all 
these isometries form a group. Since an isometry preserves the length of a vector, 
in particular that of an infinitesimal displacement vector, it may be regarded as a 
rigid motion. For example, in W , the Euclidean group E", that is the set of maps 
f : x i->- Ax + T (A e SO («), T e IK" ), is the isometry group. 



7.6.2 Conformal transformations 


Definition 7.5. Let ( M , g) be a (pseudo-)Riemannian manifold. A diffeomor- 
phism / : M — »• M is called a conformal transformation if it preserves the 
metric up to a scale, 

f*gf{ P ) = e 2a 8 P or e S-(M) (7.97a) 

namely, gf< p) (f*X, /*K) = e 2a g p (X, Y ) for X,Y e T p M. 

In components, the condition (7.97a) becomes 

dy 01 dyP o r \ 

gapifip)) = ^g„ v (p). (7.97b) 

The set of conformal transformations on M is a group, the conformal group 
denoted by Conf(M). Let us define the angle 6 between two vectors X — X^dn, 
Y = Y ,x d lx e T P M by 


gp(x, Y) = g^yXvjr 
y/g P {X, X)g p (Y,Y) y/g^XiXig^Y^' 


(7.98) 


If / is a conformal transformation, the angle 6' between /*X and /*F is given by 


cos(L 


e 2 a g pv X^Y v 

Je 2 ° gSpXS X’l .e^g Kk Y K Y^ 


= cos 6 


hence / preserves the angle. In other words, f changes the scale but not the 
shape. 

A concept related to conformal transformations is Weyl rescaling. Let g and 
g be metrics on a manifold M. g is said to be conformally related to g if 

g p = e 2a(p) g p . (7.99) 


Clearly this is an equivalence relation among the set of metrics on M. The 
equivalence class is called the conformal structure. The transformation g -> 
e 2a g is called a Weyl rescaling. The set of Weyl rescalings on M is a group 
denoted by Weyl(M). 


Example 7.8. Let w = f(z) be a holomorphic function defined on the complex 
plane C [A C°°-function regarded as a function of z = x + iy and z = x — iv is 
holomorphic if 3 if(z,z) = 0.] We write the real part and the imaginary part of the 
respective variables as z — x + iy and w = u + iv. The map / : (x, y) i->- (u, v) 
is conformal since 


dir + dtr 


/ du 


3 u 


— d.v + — dy + 


\3.r 


dy 


dv dv 

—dx + — dy 

dx dy 


/ du 
\ dx 


(dx 2 + dy 2 ) 


(7.100) 



where use has been made of the Cauchy-Riemann relations 


3m dl) dll dl> 

dx dy dy dx 

Exercise 7.15. Let / : M — > M be a conformal transformation on a Lorentz 
manifold (M, g ). Show that /* : T p M -» T f( p )M preserves the local light cone 
structure, namely 


/* : 


timelike vector timelike vector 

null vector null vector 

spacelike vector spacelike vector. 


(7.101) 


Let g be a metric on M, which is conformally related to g as g — e 2olp> g. 
Let us compute the Riemann tensor of g. We could simply substitute g into 
the defining equation (7.42). However, we follow the elegant coordinate-free 
derivation of Nomizu (1981). Let K be the difference of the covariant derivatives 
V with respect to g and V with respect to g, 


K(x, Y) = v x r-v x r. 


(7.102) 


Proposition 7.1. Let U be a vector field which corresponds to the one-form do: 
Z[o] — (der, Z) = g(U, Z). Then 

K(X, Y) = X[o]Y + Y[o]X - g(X, Y)U. (7.103) 


Proof. It follows from the torsion-free condition that K (X. Y) = K ( Y, X). Since 
X x g — Vxg = 0, we have 

X[g(Y. Z)] = V Z [|(F, Z)] = g(V x , Z) + g(Y , V X Z) 


and also 

X[g(Y, Z)i = V x [e 2cr g(Y, Z)] 

= 2 X[o]e 2cr g(Y, Z) + e 2or [g(V x , Z) + g(Y , V X Z)]. 

Taking the difference between these two expressions, we have 

g(K(X, Y), Z) + g(Y, K(X, Z)) = 2X[o]g(Y, Z). (7.104a) 

Permutations of (X . Y, Z) yield 

g(K(Y, X ), Z) + g(X, K{Y, Z)) = 2 Y[o]g{X. Z ) (7.104b) 

g(K{Z. X), Y) + g(X, K (Z, Y)) = 2 Z[o]g(X, Y). (7.104c) 

The combination (7.104a) + (7.104b) — (7.104c) yields 

g(K(X, Y), Z ) = X[o]g(Y, Z) + Y[o]g(X. Z ) - Z[o]g(X, Y). (7.105) 



The last term is modified as 


Z[ir]g(X, Y ) = g(U, Z)g(X , Y) = g{g(Y, X)U, Z). 

Substituting this into (7.105), we find 

g(K{X, Y) - X[o]Y - Y[a]X + g(X, Y)U, Z) = 0. 

Since this is true for any Z, we have (7.103). □ 

The component expression for K is 

e V ) = — (f HV r 

= e^[a]e v + e v [a]e^ - g{e^, e v )g KX d K ae\ 

from which it is readily seen that 

r \ v = r\y + S\d„(T + s\d v a - g„ v g Kk d K a. (7.106) 

To find the Riemann curvature tensor, we start from the definition, 

R(X , T)Z = V z VyZ - Vy V X Z - V [X , Y ]Z 

= V z [VyZ + K{Y, Z)\ - Vy[VxZ + K(X, Z)] 
~{V [XJ] Z+K([X, Y],Z)} 

= VzfVyZ + K(Y, Z)i + K(X, VyZ + K(Y, Z)) 

- Vy{V x Z + K(X, Z)j - K(Y , V X Z + K(X , Z)) 
-{V [x ,y]Z+^([Z,T],Z)}. (7.107) 

After a straightforward but tedious calculation, we find that 

R(X, Y)Z = R(X, Y)Z + (V z da, Z)Y - (Vy do-, Z)X 

- g(Y,Z)X x U + Y[o]Z[o]X 

- g(Y, Z)U[a]X + X[a]g(Y, Z)U 
+ g(X, Z)V Y U — X[a]Z[a]Y 

+ *(X, Z)(7[ct]T - Y[a]g(X, Z)U . (7.108) 

Let us define a type (1,1) tensor field B by 

BX = -X[a]U + X x U +{U[ g]X. (7.109) 

Since g(Vy(7, Z) = (Vyda, Z) (exercise 7.8(c)), (7.108) becomes 

R(X, T)Z = R(X, Y)Z - [g(F, Z)BI - g(BX , Z)T 
+ g(RF,Z)X-g(X, Z)BT], 


(7.110) 



In components, this becomes 


R Xijlv — R gvxBn + g^xB^ 8 v g^ x Bj 8 ^ + g^xB v 

(7.111) 


where the components of the tensor B are 


V = - 3 vl°U k + (Vuljy + it/[a]V 

= - d^og^dxo + g^id^dxa - T^rr) + ^g X hx<T 

(7.112) 

Note that B /lv = g vX B ^ = B v/l . 

By contracting the indices in (7.1 1 1), we obtain 


Ric^v — Ric/^v g/ivBx (wi 2)2? v ^, 

(7.113) 

e 2cr 77 = 77 - 2 (m - \)B k k 

where m = dim M. Equation (7.1 14a) is also written as 

(7.114a) 

guv 77 = [77 - 2 (m - 1 )B x k ]g flv . 

(7.114b) 


If we eliminate g^ llt /?;/' and B /n in R K ; M v in favour of Ric and 77 and separate 
barred and unbarred terms, we find a combination which is independent of a , 

1 

CkXiiv — RkX^iv ~t~ -(RiCK/igXv Ricx/igicv T RlcXvgxfi Ric K vgXii) 
in — I 

77 

+ ^ 2)(m jy igK/xgXv ~ guvgxii) (7.115) 

where m > 4 (see problem 7.2 for m = 3). The tensor C is called the Weyl 
tensor. The reader should verify that C KXflv = C K -,_ jn ,. 

If every point p of a (pseudo-)Riemannian manifold (M, g) has a chart 
( U,cp ) containing p such that g^ v = e 2a 8^ v , then ( M,g ) is said to be 
conformally flat. Since the Weyl tensor vanishes for a flat metric, it also vanishes 
for a conformally flat metric. If dimM > 4, then C = 0 is the necessary and 
sufficient condition for conformal flatness (Weyl-Schouten). If dimM = 3, the 
Weyl tensor vanishes identically; see problem 7.2. If dimM = 2, M is always 
conformally flat; see the next example. 

Example 7.9. Any two-dimensional Riemannian manifold (M, g) is conformally 
flat. Let (x, y) be the original local coordinates with which the metric takes the 
form 


dj 2 = g xx d.r 2 + 2 g xy dx dy + g yy dy 2 . 


(7.116) 


Let g = g xx gyy - g 2 and write (7.1 16) 


ds 2 = ( yfg^ dx + 


gxy + i V g ( 

\/ gxx 


d y Vgyy dx + 


gxy - jyg 
\fgxx 


d v 


According to the theory of differential equations, there exists an integrating factor 
A(.r, y ) = Xi(x, y ) + iA 2 (x, >’) such that 

A. ( Vgjcx dx + v ~|L =— dy ) = dn + i dt> (7.117a) 

V Vto ) 

A L/g^dx + ~ 1V ^ dy ) = dn - idu. (7.117b) 

V yfgxx ) 


Then d.? 2 = (dii 2 + dir)/|A| 2 and by setting |A| -2 = e 2cr , we have the desired 
coordinate system. The coordinates (u, v ) are called the isothermal coordinates. 
[Remark: If the curve u = a constant is regarded as an isothermal curve, v — a 
constant corresponds to the line of heat flow.] 

For example, let ds 2 = d9 2 + sin 2 6 d <p 2 be the standard metric of S 2 . Noting 


that 




1 

sind 


we find that / : (6, <fi) i-> (u, v ) defined by u = log | tan ^9 | and v — cp yields a 
conformally flat metric. In fact, 


dx 2 


sin 2 0 



= sin 2 9(du 2 + du 2 ). 


If ( M,g ) is a Lorentz manifold, we have integrating factors a (x , y ) and 
/i (x , y) such that 

A (V#** dx + — + dy^) =du+dv 

V Vfc / 

M (Vg^dx + — dy^) = du - dv 

V Vfc / 

In terms of the coordinates (n, v ) the metric takes the form d.v 2 = A -1 n~ l (dn 2 — 
dir). The product A/i is either positive definite or negative definite and we may 
set l/|A/x| = e 2cr to obtain the form 

du 2 = ±e 2cr (d u 2 - dir). (7.119) 


(7.118a) 

(7.118b) 


Exercise 7.16. Let ( M,g ) be a two-dimensional Lorentz manifold with g — 
—dt (g) dt + t 2 dx <g> dx (the Milne universe). Use the transformation \t\ hr e' 1 to 
show that g is conformally flat. In fact, it is further simplified by (rj. x ) (u — 
e' 1 si nh x, v = e' 1 cosh x). What is the resulting metric? 



7.7 Killing vector fields and conformal Killing vector fields 
7.7.1 Killing vector fields 


Let ( M , g) be a Riemannian manifold and X e X(M). If a displacement eX, s 
being infinitesimal, generates an isometry, the vector field X is called a Killing 
vector field. The coordinates x 1 ' of a point p e M change to x /l + eX 1 ' ( p) under 
this displacement, see (5.42). If / : x^ i->- x ^ + sX ^ is an isometry, it satisfies 
(7.96b), 


d(x K +sX K ) d(x x + eX x ) 


gtcxix + sX) = gi_ LV (x). 


dx t 1 dx v 

After a simple calculation, we find that g llv and X 1 ' satisfy the Killing equation 


xh^ gfJLV + + d v X l gflX = 0. (7.120a) 


From the definition of the Lie derivative, this is written in a compact form as 


(£xg)w=0. (7.120b) 

Let (p t : M -* M be a one-parameter group of transformations which generates 
the Killing vector field X. Equation (7.120b) then shows that the local geometry 
does not change as we move along ([>,. In this sense, the Killing vector fields 
represent the direction of the symmetry of a manifold. 

A set of Killing vector fields are defined to be dependent if one of them 
is expressed as a linear combination of others with constant coefficients. Thus, 
there may be more Killing vector fields than the dimension of the manifold. [The 
number of independent symmetries has no direct connection with dimM. The 
maximum number, however, has; see example 7.10.] 


Exercise 7.17. Let V be the Levi-Civita connection. Show that the Killing 
equation is written as 

(V /t Z) y + (V„XV = d„X v + d v X„ - 2r\ v X x = 0. (7.121) 

Exercise 7.18. Find three Killing vector fields of (M 2 , 5) . Show that two of 
them correspond to translations while the third corresponds to a rotation; cf next 
example. 


Example 7.10. Let us work out the Killing vector fields of the Minkowski 
spacetime (IR 4 , rj), for which all the Levi-Civita connection coefficients vanish. 
The Killing equation becomes 


d„X v + d v X^ = 0. 


(7.122) 


It is easy to see that X is, at most, of the first order in x. The constant solutions 



correspond to spacetime translations. Next, let X v , « //i; being constant. 

Equation (7. 122) implies that a llv is anti-symmetric with respect to p. -*> v. Since 
(2) = 6, there are six independent solutions of this form, three of which 

X (j) o - 0 X (J)m = Sj mn x n (1 < j, 77i, n < 3) (7.123b) 

correspond to spatial rotations about the x 7 -axis, while the others 

(A)o = x k X (k)m = -S km x° (1 < k, m < 3) (7.123c) 

correspond to Lorentz boosts along the x^’-axis. 

In ??7 -dimensional Minkowski spacetime (m > 2), there are 777(777 + l)/2 
Killing vector fields, m of which generate translations, (m — 1), boosts and 
(777 — l)(m — 2)/2, space rotations. Those spaces (or spacetimes) which admit 
777 (777 + 1 )/2 Killing vector fields are called maximally symmetric spaces. 

Let X and Y be two Killing vector fields. We easily verify that 

(i) a linear combination a X + bY (a. b e IK) is a Killing vector field; and 

(ii) the Lie bracket [X . V] is a Killing vector field. 

(i) is obvious from the linearity of the covariant derivative. To prove (ii), we use 
(5.58). We have C[ X ,Y]g = C x C, Y g - C Y C x g — 0, since C x g = Cyg = 0. 
Thus, all the Killing vector fields form a Lie algebra of the symmetric operations 
on the manifold M; see the next example. 

Example 7.11. Let g = dO <S> dd + sin 2 0 d(p ® dcp be the standard metric of S 2 . 
The Killing equations (7.121) are: 

d e X e + deX e =0 (7.124a) 

d^Xfj, + d^X,/) + 2 sin0 cosOXg = 0 (7.124b) 

dgX^ + d^Xg -2cot(9X 0 = 0. (7.124c) 

It follows from (7.124a) that Xg is independent of 6: Xg(Q,<p) — /(</>). 
Substituting this into (7.124b), we have 


Xfj, — — F(cp) sin0 cos# + g{6) (7.125) 

where F(<p) — J ^ /(</>) d cp. Substitution of (7.125) into (7.124c) yields 
, , d? df 

— cos“ 9 — shr 9) H 1 — 1 — f 2 cot 9{F(<p) sin0 cos# — g(9)) = 0. 

d 9 d d> 


This equation may be separated into 

dg 


df 


A6 


- 2cot 9g(9) = - - F{<p). 



Since both sides must be separately constant (=C), we have 


A6 


— 2 cot 9g(0) — C 


d / 


+ F(4>) = -c. 


(7.126a) 

(7.126b) 


Equation (7.126a) is solved if we multiply both sides by exp(— f d 9 2cot$) = 
sin 2 9 to make the LHS a total derivative, 

A ( zAA _ c 

A9 \ sin 2 0 ) sin 2 9~ 


The solution is easily found to be 

g(9) = (Ci - C cot 9) sin 2 9. 

Differentiating (7.126b) again, we find that / is harmonic, 

Xo(cp) — /((/>) = A sirup + B cos (p 
F(<p)= — A cos (p + B sin0 — C. 

Substituting these results into (7.125), we have 

X l / > (9, <p) — — (— A cos (f> + B sin0 — C) sind cos 9 + (Ci — C cot 9) sin 2 9 
= (A cos</> — B sin</>) sin0cos0 + Ci sin 2 0. 

A general Killing vector is given by 


a 3 J. 3 

X = X e b X^ — 

39 3<p 

(3 3 

— A sin (p H cos (f> cot 9 — 

\ 39 3(p 

(3 3 \ 3 

+ s ^o^-- s ,n 0cote-j + c,-. 


The basis vectors 


3 3 

L x = — cos (p b cot 9 sin (p — 

39 3<p 

3 3 

L y = sin (p b cot 9 cos (p — 

39 3 (p 


L z = — 


3 

dip 


(7.127) 


(7.128a) 

(7.128b) 

(7.128c) 


generate rotations round the x, y and z axes respectively. 



These vectors generate the Lie algebra so(3). This reflects the fact that S 2 
is the homogeneous space SO(3)/SO(2) and the metric on S 2 retains this SO(3) 
symmetry (see example 5.18(a)). In general 5” = SO (n + 1 )/SO(«) with the 
usual metric has dim SO(n + 1) = n (n + l)/2 Killing vectors and they form 
the Lie algebra so (n + 1). The sphere S" with the usual metric is a maximally 
symmetric space. We may squash S" so that it has fewer symmetries. For 
example, if S 2 considered here is squashed along the 7-axis it has a rotational 
symmetry around the "-axis only and there exists one Killing vector field L- — 
d/dtp. 


7.7.2 Conformal Killing vector fields 

Let (M, g ) be a Riemannian manifold and let X e X(M). If an infinitesimal 
displacement given by eX generates a conformal transformation, the vector field 
X is called a conformal Killing vector field (CKV). Under the displacement 
x ^ — »■ x 11 + sX 11 , this condition is written as 


d(x K + sX K ) d(x l + sX l ) 2(T 

-g K x(x + eX ) = e- gftvix). 


dx^ 


dx l 


Noting that a oc e, we set a — eip/2, where i fr e T(M). Then we find that g flv 
and X 11 satisfy 


Cxgw = xh^ gflv + 3 ^X K g KV + 3 v X x gfll = $ gi 


I1V- 


(7.129a) 


Equation (7.129a) is easily solved for t// to yield 

X^g^hguv + 23„X' i 

f = — ifLLZ — H (7.129b) 

m 

where m = dim M. We verify that 

(i) a linear combination of CKVs is a CKV: (C a x+bY g) nv — (a<P + b\p)g^ v 
where a, b e ffi, Cxg^v = <Pgiiv and Cyg^v = fg^v', 

(ii) the Lie bracket [X, Y ] of a CKV is again a CKV: C\x.Y]giiv = (X[\[r] — 
Y[<p])gn V . 


Example 7.12. Let x 11 be the coordinates of (ffi m , <5). The vector 

D = x^-^— (7.130) 

OX P 

(dilatation vector) is a CKV. In fact, 


C-D&nv = d ll x K 8 KV + d v x x 8^x = 2 8^ v . 



7.8 Non-coordinate bases 


7.8.1 Definitions 

In the coordinate basis, T p M is spanned by {e p } = {d/dx 11 } and T*M by {dx ll \. 
If, moreover, M is endowed with a metric g, there may be an alternative choice. 
Let us consider the linear combination, 

e a = e^— {<?«/} e GL(m, ffi) (7.131) 

dx^ 

where dete a ^ > 0. In other words, {e a } is the frame of basis vectors which is 
obtained by a GL(m, HR) -rotation of the basis {e p } preserving the orientation. We 
require that {e a } be orthonormal with respect to g, 

g(e a , ep) = e^ep v gf_ lv = 8 a p. (7.132a) 

If the manifold is Lorentzian, 8 a p should be replaced by r) a p. We easily reverse 
(7.132a), 

guv — e a P e^ v 8 a p (7.132b) 

where e a p is the inverse of e a ^\ e a p e a v = 8 p v , e a p^ = 8 a p. [We have used 
the same symbols for a matrix and its inverse. So long as the indices are written 
explicitly it does not cause confusion.] Since a vector V is independent of the 
basis chosen, we have V = = V a e a = 'V a e a ^e p . It follows that 

V 11 = W V“ = (7.133) 

Let us introduce the dual basis {6 01 } defined by (9 a , ep ) = 8 a p. 9 a is given 
by 

9 a =e a p dx^. (7.134) 

In terms of {9 01 }, the metric is 

g = g^ dx^ ® dx v = 8 a p9 a ® §P. (7.135) 

The bases {e a } and {()“ ) are called the non-coordinate bases. We use k, X, 
/i, v, . . . (a, ft. y. 8, . . .) to denote the coordinate (non-coordinate) basis. The 
coefficients e a 11 are called the vierbeins if the space is four dimensional and 
vielbeins if it is many dimensional. The non-coordinate basis has a non-vanishing 
Lie bracket. If the {e a } are given by (7.131 ), they satisfy 

[e a , epi\ P — c a p Y (p)e y \ p (7.136a) 

where 

Cap 7 (p) = e y v [e a li df l ep v - ep^e^Kp). (7.136b) 



(7.137) 


Example 7.13. The standard metric on S 2 is 

g = d0 <8> d6 + sin 2 6 dcp <g> dcp — Q 1 <g> 0 1 + 0 2 <g> 0 2 
where 9 l = d8 and 8 2 — sin0 dcp. The ‘zweibeins’ are 

e 1 e = 1 e l <j, = 0 
e 2 @ = 0 e 2 ^ = sin0. 

The non- vanishing components of c a p Y are C 12 2 = — C 21 2 = — cot0. 

Exercise 7.19. (a) Verify the identities, 

=SP ft e a »ep\ 

(b) Let y a be the Dirac matrices in Minkowski spacetime, which satisfy 
{y a , yP} — 2ri a P . Define the curved spacetime counterparts of the Dirac matrices 
by = e^y 01 . Show that 

{y^, y v } = 2g^ v . (7.140) 

7.8.2 Cartan’s structure equations 

In section 7.3 the curvature tensor R and the torsion tensor T have been defined 
by 


(7.138) 


(7.139) 


R(X, Y)Z = V x V y Z - V F V z Z - V [X ,F]Z 
T(X. Y) = V X T - V F Z - [X, Y], 

Let {e a } be the non-coordinate basis and {0“} the dual basis. The vector fields 
{e a } satisfy [ e a , ep\ = c a p Y e y . Define the connection coefficients with respect to 
the basis {e a } by 

V a ep = = r Y a pe y . (7.141) 

Let e a = e a ^e^. Then (7.141) becomes e a >*(d ll ep v + ep k r v ^)e v = Y y a pe y v e v , 
from which we find that 

r Y a p = e Y v e a ^{d ll ep v + ep X T v = e y v eJ l V l _ l ep v . (7.142) 

The components of T and R in this basis are given by 

T a Py = <0°, T(ep, e Y )) = <0°, Vpe y - V y ep - [ep, e Y ]) 

= T °‘p y -T a y p-cp y a . (7.143) 

R a PyS = (0 a , Vy^sep - v s v y ep - V^ h] ep) 

= (0 a , Vy(r e S pe £ ) - V*T £ Y pe £ ) - c yS E V £ ep) 

= e Y [r a S p] - e s [r %p] + r £ ^rv - - CyS E r V 

(7.144) 



We define a matrix-valued one-form { of p ) called the connection one-form by 


ofp = r a y p§ 7 . (7.145) 

Theorem 7.3. The connection one-form of p satisfies Cartan’s structure 
equations, 

d<9" + ofp A 9 P = T a (7. 146a) 

d ofp + of Y A a/p = R a p (7.146b) 

where we have introduced the torsion two-form T a = jT a p y 6 p A 6 7 and the 
curvature two-form R a p = \ R" p Y sO y A 9 s . 

Proof. Let the LHS of (7.146a) act on the basis vectors e y and es, 

d e a (e Y , e s ) + [{afp, e Y ){9 p ,e s ) - {9 P ,e Y ){a> a p, eg)] 

= {e Y Ud a , h)] ~ es[(e\ e Y )} - (0 a , [e Y ,e s ])} + {(. »“a , e Y ) - [co a Y , e s )} 

„ a , r'Q' r'Qi 

— Cy8 i 1 y, 5 1 8y — y8 

where use has been made of (5.70). The RHS acting on e y and e§ yields 

\T a p B [{9 p , e y ){6 e , es) - {9 e , e Y ){e p ,es)] = T a v s 

which verifies (7.146a). 

Equation (7. 146b) may be proved similarly (exercise). □ 

Taking the exterior derivative of (7. 146a) and (7. 146b), we have the Bianchi 
identities 

dr“ + off) A T? = R a p A (7. 147a) 

dR 01 p + of Y A R 7 p — R a y A of p = 0. (7.147b) 

These are the non-coordinate basis versions of (7.81a) and (7.81b). 

7.8.3 The local frame 

In an w-dimensional Riemannian manifold, the metric tensor g ll v has m(m+ l)/2 
degrees of freedom while the vielbein ef has m 2 degrees of freedom. There are 
many non-coordinate bases which yield the same metric, g, each of which is 
related to the other by the local orthogonal rotation, 

9 a — > e' a (p ) = A a p(p)9 p (p) (7.148) 

at each point p. The vielbein transforms as 

e%(p) — > e'%(p) = A a p(p)ef , lx (p). (7.149) 



Unlike k,X, (i,v, . . . which transform under coordinate changes, the indices 
a, /S, y, . . . transform under the local orthogonal rotation and are inert under 
coordinate changes. Since the metric tensor is invariant under the rotation, 
satisfies 

A“ pS a sA s y — 8p y if M is Riemannian (7.150a) 

A“ prjasA 8 y — rip-y if M is Lorentzian. (7.150b) 

This implies that { A"^ (/?)} e SO (m) if M is Riemannian with dimM = m and 
{A"^!/?)} e SOO« — 1 , 1 ) if M is Lorentzian. The dimension of these Lie groups 
is 777(777 — l )/2 = 777 2 — 777(777 + l)/ 2 , that is the difference between the degrees 
of freedom of e a ^ and g ^ v . Under the local frame rotation A 01 p{p), the indices 
a, P, y, 8, . . . are rotated while 7 t, X, fi, v, . . . (world indices) are not affected. 
Under the rotation ( 7 . 148 ), the basis vector transforms as 

««— ►e«=^(A" 1 ) /, „. (7.151) 

Let t — t ll v e lx ® d.r y be a tensor field of type (1, 1). In the bases {e a } 
and {0“}, we have t = t 01 pe a ® 9^, where t a p — e a lx ep v t tl v . If the new 
frames {e f a } = {^(A -1 )^^} and {O' 01 } = {A a p§P} are employed, the tensor t 
is expressed as 

t = t' a pe' a ® O'P = t ,a pe y (A l ) Y a ® A^g9 S 
from which we find the transformation rule, 

t a . Ja A O' t y t A —1,4 

t p > t p = A y t r ) p. 

To summarize, the upper (lower) non-coordinate indices are rotated by A (A -1 ). 
The change from the coordinate basis to the non-coordinate basis is carried out 
by multiplications of vielbeins. 

From these facts we find the transformation rule of the connection one-form 
a) a p. The torsion two-form transforms as 

T a — y T' a = d 9 ,a + co' a p A 9 ,ft = A a p[d9 p + w p y A 9 Y ], 

Substituting B' a — A 01 p9^ into this equation, we find that 

co ,a pAP Y =A a s CD S y-dA a y. 

Multiplying both sides by A -1 from the right, we have 

©'"/» = A a yCO Y s (A- l ) S p + A 01 y (dA~ l ) y p (7.152) 

where use has been made of the identity dA A -1 + A dA -1 = 0, which is derived 
from AA _1 = /,„. 

The curvature two-form transforms homogeneously as 

R a p -a R' a p = A a y R y $(A~ 1 ) S p (7.153) 

under a local frame rotation A. 



7.8.4 The Levi-Civita connection in a non-coordinate basis 


Let V be a Levi-Civita connection on (M, g), which is characterized by the metric 
compatibility Vxg = 0, and the vanishing torsion F^,, — r\ 7/ = 0. It is 
interesting to see how these conditions are expressed in the present approach. 
The components r' /lv and r a p y are related to each other by (7.142). Let (M, g) 
be a Riemannian manifold (if (M, g) is Lorentzian, we simply replace S aj g all 
below by rj al g). If we define the Ricci rotation coefficient r a p y by 8 a sr s p y the 
metric compatibility is expressed as 

r a py = 8ase S xe^ l 'V fl e y k — -Sase^ep^V^e 8 \ 

= - Syse^e^V^ = -r YPa (7.154) 

where V^g = 0 has been used. In terms of the connection one-form u> a p = 
8 ay u> y p, this becomes 

u>oip — —u>p a . (7.155) 

The torsion-free condition is 

d0 a + oo a p /\ ¥ = 0. (7.156) 

The reader should verify that (7.156) implies the symmetry of the connection 
coefficient 1'' ^,., = r\ 7 , in the coordinate basis. The condition (7.156) enables 
us to compute the c a p y of the basis { e u ) . Let us look at the commutation relation 

c a p r e y — [e a , ep] — V a ep - Vpe a (7.157) 

where the final equality follows from the torsion-free condition. From (7.141), 
we find that 

c aP Y = rr ap -ry pa . (7.158) 

Substituting (7.158) into (7.144) we may express the Riemaun curvature tensor in 
terms of T only, 

R a p y s = e Y [r a S p] - e s [r a y p] + r £ S pr a ys - r £ K/j r“ fc 

-(r e yS -r e Sy )r a e p. (7.159) 

Example 7.14. Let us take the sphere S 2 of example 7.13. The components of 
e a fi are 

e l e = 1 e l </) = 0 e 2 g = 0 e 2 ^ = sin0. (7.160) 

We first note that the metric condition implies co\\ — u > 22 — 0, hence o ) 1 1 = 
co 2 2 — 0. Other connection one-forms are obtained from the torsion-free 
conditions, 

d(d0) + <w 1 2 A (sin 6 dip) = 0 
d(sin 9 dip) + co 2 1 A dd = 0. 


(7.161a) 

(7.161b) 



From the second equation of (7.161), we easily see that co 2 \ = cos Odcp and the 
metric condition con — —con implies to 1 2 = — cos Odcp. The Riemann tensor is 
also found from Cartan’s structure equation. 


co l 2 Aft. 2 i = \R l i a p9 a A§ p (7.162a) 

da) 1 2 =\R X 2 a ftO a (7.162b) 

d co 2 i = \R 2 \ a pO a A §P (7.162c) 

co 2 ! aco 1 2 = jR 2 2a pd a (7.162d) 


The non-vanishing components are R 1 212 = — /f 1 221 = sin 0, R 2 \n — 
— R 2 n\ — — sin 6. The transition to the coordinate basis expression is carried 
ont with the help of c„ /; and e" /; . For example, 


R 9 rj>e<p = e a 9 e p ( pe y 0e s ^R c, pys — — R X 2 n = t — -• 

sin 2 6 sin 6 

Example 7.15. The Schwarzschild metric is given by 

d.v 2 = — ^1 — — — ^ dr 2 H ~2M ^ r 2 ^ 2 + sin 2 ^ d</> 2 ) 

r 

= - 0° <g> 0° + 9 1 ® (9 1 + (9 2 <g> <9 2 + <9 3 <g> 0 3 (7.163) 


where 


0 ° 



0 2 = r d 0 0 3 = r sin 0 dcp. 


(7.164) 


The parameters run over the range 0 < 2 M < r, 0 < 9 < 7t and 0 < <p < 2ic. 
The metric condition yields <w°o — co 1 ! — or 2 — co 3 3 = 0 and the torsion-free 
conditions are: 


d[(l - 2M/r) 1 / 2 dr] + co°p A 9 P = 0 

(7.165a) 

d[(l - 2M/r)~ 1/2 dr] + co 1 p A 0? = 0 

(7.165b) 

d(r dO) + co 2 p A 6 P = 0 

(7.165c) 

d(r sin 6 dcp) + co 3 p A 6^ — 0. 

(7.165d) 


The non-vanishing components of the connection one-forms are 


n 1 M 
co 1 — co 0 = -7 dr 
r- 


co 2 1 


— CO 2 



d e 


co\ = 


-co 1 3 



sin 6 dcf> 


co 3 2 = —co 2 3 = cost 


(7.166) 



The curvature two-forms are found from the structure equations to be 


n I 2 M -n 
R° l = R l 0 = —0° AO 1 
r 5 

R° 3 = R 3 o = A§ 3 

r 5 

i , M -i ** 

R l 3 = -R 3 1 = 5-0 1 AO 3 

r 5 


R° 2 = R 2 0 = -^-§° A§ 2 

r i 

/?*2 = -/? 1 = A 0 2 

tf 2 3 = -tf 3 2 = — r 6 2 AQ 3 . 
r i 


(7.167) 


7.9 Differential forms and Hodge theory 
7.9.1 Invariant volume elements 

We have defined the volume element as a non- vanishing m-form on an tri- 
dimensional orientable manifold M in section 5.5. If M is endowed with a 
metric g, there exists a natural volume element which is invariant under coordinate 
transformation. Let us define the invariant volume element by 


Q.M = vTil dx 1 A dx 2 A ... A dx" 


(7.168) 


where g — det and x ,L are the coordinates of the chart ( U , <p). The /«-form 
Q M is, indeed, invariant under a coordinate change. Let y x be the coordinates of 
another chart ( V, i jr) with U IT V 0. The invariant volume element is 


( dx ^ dx v 
det | — — g^v 


dy 1 A ... A dy” 


\ 3 y K dy x 

in terms of the y-coordinates. Noting that dy' 1 " = ( dy x /dx ^) dx /J , this becomes 
/ dx 11 N 


det 


A/TiT det dx 1 A dx 


2 A ... A dx" 


\d y K 

— iyjgjdr 1 A dx 2 A ... A dx'". 

If x ^ and y K define the same orientation, det(3.r /i /3 y K ) is strictly positive on 
U fl V and Q.m is invariant under the coordinate change. 

Exercise 7.20. Let {6 01 } — {e“ M dx M } be the non-coordinate basis. Show that the 
invariant volume element is written as 


£2 m = \e\ dx 1 A dx 2 A ... A dx m = 6 l A § 2 A . . . A 6” 
where e — det e u „ . 


(7.169) 


Now that we have defined the invariant volume element, it is natural to define 
an integration of / e T ( M ) over M by 


[ f£2 M = f fM d* 1 dx 2 . . . dx” 
JM JM 


(7.170) 



Obviously (7.170) is invariant under a change of coordinates. In physics, there 
are many objects which are expressed as volume integrals of this type, see 
section 7.10. 


7.9.2 Duality transformations (Hodge star) 


As noted in section 5.4, £l r (M) is isomorphic to Q' n ~ r (M ) on an m -dimensional 
manifold M. If M is endowed with a metric g, we can define a natural 
isomorphism between them called the Hodge * operation. Define the totally 
anti-symmetric tensor s by 




Note that 


+ 1 if (/x i /rt 2 - • - Mm) is an even permutation of (12 . . . m) 

— 1 if (/u i M 2 • • • Mm ) is an odd permutation of ( 12 ... m) 

0 otherwise. 

(7.171a) 


HUll-Hm _ „H2V2 a flmVm c — o _1 o (1 17 1 

8 —8 8 ■■■8 v 2 ...v m — 8 8 m (./.r/rDj 

The Hodge * is a linear map * : Q.'\M) -* Q m ~ r (M) whose action on a basis 
vector of is defined by 


* (dx Ml A dx^ 2 A ... A dx^ r ) 

a/IH 


i,Atl WI-V-t 


(, m — r)! 


v r + l-v. 


dx Vr+1 A...Adx l ' m . 


(7.172) 


It should be noted that * 1 is the invariant volume element: 


*1 = — 7 -SniM-Hm A ... A dx^" 1 = yj\g\ dx 1 A ... A dx m . 

m ! 

For 

co — —cOfi dx^ 1 A dx M2 A ... A dx^ r e £2' (M) 

we have 


*u> ■ 


vTil 


r\(m — r)! 


-co. 


1*1/* 2-/*r c 


cPll*2-l*r 


If we take the non-coordinate basis {6° 
becomes 


dx Vr+1 A ... Adx Vm . (7.173) 
= {e a ll dx ll \, the * operation 


*(0“‘ A ... A 9 0lr ) = 


1 


(m — r)[ 


ai...a r 


Pr + 1 ■■■A 


§Pr + 1 


A ... A ( 


where 


e «i .«m 


+ 1 if (a i . . . a m ) is an even permutation of (12 ... m) 
— 1 if (ai . . . a m ) is an odd permutation of ( 12 . . . m) 
0 otherwise 


(7.174) 


(7.175) 



and the indices are raised by S a P or rf^ . 

Theorem 7.4. 

**co = (-l) nm ~ r) co. (7.176a) 

if ( M , g) is Riemannian and 

* * co = (-l) 1+r( "- r) ® (7.176b) 


if Lorentzian. 

Proof. It is simpler to prove (7. 176a) with a non-coordinate basis. Let 

CO — —(O O i 1 ...a r 0 Cl1 A ... A 0“ r . 
r! 

Repeated applications of * on co yield 


1 


* * co 


:(o. 


1 a\...a r 


(m — r)! 


Pr+l ■•■Pm 


X —e 

r\ 


p r+ i...p m ^ yr §n a ... a e Yr 


(_ iy(m-r) 

r\r\(m — r)\ 


U> <Xl ..Olr e Cll...ClrP r+ l-P m e Y\-YrPr+l-Pm 


ctPy 


X6 V1 A ... A §^ r 

(_ r \r(m—r ) 

= —— T — co ai ... ar e ai A...Ae a r = (-iy {m ~ r) co 

r\ 

where use has been made of the identity 

S Oll-OlrPr+l -Pm S n-YrPr+l-Pm@ n A • ■ • A §»= r!(m ~ r)!^ 1 A ... A 9°‘ r . 
Py 

The proof of (7. 176b) is left as an exercise to the reader (use det rj — — 1 ). □ 

Thus, we find that (— l)' ( m -') * * ( or (— l) 1 +r(m-r) * ^ j s an identity map 
on Q r ( M ). We define the inverse of * by 

* _1 = (— 1 y (m ~ r ) * (M, g ) is Riemannian (7.177a) 

* _1 = (_i) 1 +A'"-'-) * (M, g) is Lorentzian. (7.177b) 


7.9.3 Inner products of r -forms 

Take 

co — —(Oft dx ,X| A ... A dx^ 
7 = tlx'* 1 A ... A dx /tr . 



The exterior product co A *rj is an m- form: 


co A */? 


1 




pVl-.Vr 


( r ^2 CO Hl~Vr r lvi-Vr (m _ r y° Hr+l-Vm 

x dx w A ... A dx #ir A dx /v+ i A ... A dx 

1 


nE 


^1 ...Hr*! 


V\...V r _ 


fiv 


r\(m - r)\ Vl - Vr ^ r + i - /im 


Tiildx 1 A ... Adx'" 
" l * r a/IsT d-v 1 A ... A dx'". 


This expression shows that the product is symmetric: 


<n a *rj — r) A *&). 

Let {0" } be the non-coordinate basis and 

O) = — O> ai „. a ,.0 W A . . . A § ar 
r ! 

n = ~.riai...a r O ai A... A 0 ar . 
r! 

Equation (7.178) is rewritten as 

CO A *T) = —COa, a r lf l ' 0 ‘ r O l A ... A 0'". 
r ! 

Since a A */3 is an m-form, its integral over M is well defined, 
inner product (co, r /) of two /--forms by 

(co, /;) = J co A *r] 

= —. I d *‘ • ■ - dx'". 

r - JM 

Since co A *i] = ij A *co, the inner product is symmetric, 

(co, rj) = (r), co). 

If (M, g) is Riemannian, the inner product is positive definite, 

(a, a) > 0. 


(7.178) 

(7.179) 


(7.180) 
Define the 

(7.181) 

(7.182) 

(7.183) 


where the equality holds only when a = 0. This is not true if (M, g) is Lorentzian. 



7.9.4 Adjoints of exterior derivatives 

Definition 7.6. Let d : Q. r ~ { (M) -» £2 r (M) be the exterior derivative operator. 
The adjoint exterior derivative operator d^ : Q' ( M ) — > (M) is defined by 

d + = (-l) mr+m+1 * d* (7.184a) 

if (M, g) is Riemannian and 

d f = (-l) mr+m *d* (7.184b) 

if Lorentzian, where m — dim M. 


In summary, we have the following diagram (for a Riemannian manifold). 


Q. m ~ r (M) 
*1 


(-D" 


-> Q" l ~ r+l (M) 


07 (M) 




(7.185) 


-> O r ~ x (M). 


The operator d f is nilpotent since d is: d t2 = *d * *d* oc *d 2 * = 0. 


Theorem 7.5. Let (M, g) be a compact orientable manifold without a boundary 
and a e O'(M), ft e 0'~ l (M). Then 

(dp, a) = (0, dV). (7.186) 


Proof. Since both dp A *a and p A *d*a are m -forms, their integrals over M are 
well defined. Let d act on P A *a, 

d(P A *a) = dp A — (— 1)' P A d * a. 


Suppose (M, g) is Riemannian. Noting that d * ot is an (m — r + l)-form and 
inserting the identity map (— i)( m - r +i)[m-(m-r+l)] * * _ (_ i)mr+m+r+t ^ * j n 
front of d * a in the second term, we have 


d(/l A *a) = dp A — (— 1 y r + m + i p ^ *(*d * a). 

Integrating this equation over M, we have 

I dpA*a-f p A*[(-l) mr+m+1 *d*a] = f d(P A *a) 

Jm Jm Jm 

= / p A *a = 0 
JdM 

where the last equality follows by assumption. This shows that (dp, a) = 
(P, d 1 ^). The reader should check how the proof is modified when (M, g) is 
Lorentzian. □ 



7.9.5 The Laplacian, harmonic forms and the Hodge decomposition 
theorem 

Definition 7.7. The Laplacian A : £3' (M) — > Q r ( M ) is defined by 

A = (d+d t ) 2 = dd t + d t d. (7.187) 


As an example, we obtain the explicit form of A : Q (} ( M ) — >• Q"(M). Let 
/ e jF(M). Since d’ / = 0, we have 

A/ = d f d/ = — * d * (d/if dx M ) 

= - * d f (w ^|), 9 /x/g M £Av 2 ...v m d.r y2 A ... A dx ym ") 

= - *- — 3M/] e Av 2 ...v„, dx v A dx" 2 A ... A dx" m 
{in — 1 )! 

= - *d v [V\8\8 V,1 di.if]g~ l dx * A ... A dx m 

= ~ -^=d v [V\8\8 Vtl ^fl (7.188) 


Exercise 7.21. Take a one-form a> = co^dx^ in the Euclidean space (IR m ,<5). 
Show that 


Aco=-J2 

M=1 


3 2 co v 
dx^dx^ 


dx". 


Example 7.16. In example 5.11, it was shown that half of the Maxwell equations 
are reduced to the identity, d F — d 2 A = 0, where A = ,4 ; , dx M is the vector 
potential one-form and F — dA is the electromagnetic two-form. Let p be the 
electric charge density and j the electric current density and form the current one- 
form j = r], iV j v dx 1 ' = —p dr + j ■ dx. Then the remaining Maxwell equations 
become 

d t F = d t dA = y. (7.189a) 

The component expression is 

3 E 

V-E = p VxB =j. (7.189b) 

3 1 

The vector potential A has a large number of degrees of freedom and we can 
always choose an A which satisfies the Lorentz condition d'A = 0. Then 
(7.189a) becomes (dd^ + d^d)A = AA = j . 


Let (M, g) be a compact Riemannian manifold. The Laplacian A is a 
positive operator on M in the sense that 


(co, A co) = (ft), (d'd + dd')ft;) = (do), dco) + (d^co, d^co) > 0 (7.190) 



where (7.183) has been used. An /'-form o> is called harmonic if A o> = 0 and 
closed (coclosed) if do/ = 0 ( d ; to = 0). The following theorem is a direct 
consequence of (7.190). 

Theorem 7.6. An r- form o> is harmonic if and only if a> is closed and coclosed. 

An r-form o> is called coexact if it is written globally as 

«r=d f p r+l (7.191) 

where /3,. + i e Sl r+l (M) [cf a form ay e £2 r (M) is exact if ay = da r _ i, 
a ,— 1 e £2 r ~ l (M)]. We denote the set of harmonic /--forms on M by Harm' (M) 
and the set of exact /--forms (coexact /--forms) by d£2 r_1 (M) (d^£2 r+1 (M)). 
[Note: The set of exact /--forms has been denoted by B r (M) so far.] 

Theorem 7. 7. (Hodge decomposition theorem) Let ( M.g ) be a compact 
orientable Riemannian manifold without a boundary. Then QT (M) is uniquely 
decomposed as 

Q'\M) = d£2' _1 (M) © d f £2 r+1 (M) 0 Harm'(M). (7.192a) 

[That is, any r-form ay is written globally as 

ay = da,-! + d 1 i + y r (7.192b) 

where a ,- 1 e Q r ~ l (M), fi r+ \ e f2' +1 (M) and y r e Harm' (M).] 

If r = 0, we define = {0}. The proof of this theorem requires the 

results of the following two easy exercises. 

Exercise 7.22. Let (M, g) be as given in theorem 7.7. Show that 

(da,- 1; d + /J r+1 ) = (da,— i , y r ) = (d^+j, y r ) = 0. (7.193) 

Show also that if &>,• e Q''(M) satisfies 

(da,— i , a> r ) = (d T yS,. +1 , co r ) = (y r , co r ) = 0 (7.194) 

for any da,— j e d £T' -1 (M), d^^r+i e d’^'' +1 (M) and /,■ e Harm r (M), then 
co r = 0. 

Exercise 7.23. Suppose a> r e Q r (M) is written as ay = Ai/r,- for some xj/ r e 
Q r (M). Show that (ay, y r ) — 0 for any y r e Harm' (M). The proof of the 
converse ‘if ay is orthogonal to any harmonic r-form, then ay is written as A///> 
for some xfi y e £Z r (M)’ is highly technical and we just state that the operator A -1 
(the Green function) is well defined in the present problem and r/r,- is given by 
A _1 ay. 

Let P : Q' (M) -» Harm' (M) be a projection operator to the space of 
harmonic r-forms. Take an element ay e Q' (M ). Since ay — Pay is orthogonal 
to Harm' (M), it can be written as Ai// r for some i /r,. e £2 r (M). Then we have 

ay = d(dVr) + d f (di/v) + Poo r . (7.195) 

This realizes the decomposition of theorem 7.7. 



7.9.6 Harmonic forms and de Rham cohomology groups 


We show that any element of the de Rham cohomology group has a unique 
harmonic representative. Let [ ay ] e H r (M). We first show that ay e 
Harm r (M) © df2' _1 (M). According to (7.192b), ay is decomposed as ay — 
y r + doy _ 1 + d * p r + 1 . Since day = 0, we have 


0 = (day, Pr+ 1 ) = (dd*Pr+U Pr+ 1) = (d t /8 r+ l, d' Pr+\)- 


This is satisfied if and only if d' /l, +i = 0. Hence, ay — y r + da r -i. From (7.195) 
we have 

ay — Pco r + d(d^) = Pay + dd ' A _1 ay. (7.196a) 

Yr = Pco, is the harmonic representative of [o> r ]. Let ay be another representative 
of [ay]: ay — ay — drj r -i, rj,- 1 e f2'' -1 (M). Corresponding to (7.196a), we have 

co r — P (Or + d(d’ A~ l co r ) — P ay + d(. . .) (7.196b) 

where the last equality follows since dry _i is orthogonal to Harm' (M) and hence 
its projection to Harm ' (M) vanishes. (7.196a) and (7.196b) show that [ay] has a 
unique harmonic representative Pay . 

This proof shows that H'\M) C Harm' (M). Now we prove that H' (M ) D 
Harm' (M). Since dy r — 0 for any y r e Harm' (M), we find that Z' (M) D 
Harm' (M). We also have B'\M) D Harm' (M) = 0 since B'\M) = d Q r ~ l (M), 
see (7.192a). Thus, every element of Harm' (M) is a non-trivial member of 
H' {M ) and we find that Harm ' (M) is a vector subspace of H r (M) and hence 
Harm ' (M) C H r (M). We have proved: 

Theorem 7.8. (Hodge’s theorem) On a compact orientable Riemannian manifold 
(M, g), H'\M ) is isomorphic to Harm' (M): 

H r (M) = Harm' (M). (7.197) 

The isomorphism is provided by identifying [<w] e H' (M) with Pco e 
Harm r (M). 

In particular, we have 

dim Harm'' (M) = dim H’\M) = b r (7.198) 

b r being the Betti number. The Euler characteristic is given by 

X(M) = ^(-1 ) r h r = ^(-l) r dim Harm" (M) (7.199) 

see theorem 3.7. We note that the LHS is a topological quantity while the RHS is 
an analytical quantity given by the eigenvalue problem of the Laplacian A. 



7.10 Aspects of general relativity 
7.10.1 Introduction to general relativity 

The general theory of relativity is one of the most beautiful and successful 
theories in classical physics. There is no disagreement between the theory 
and astrophysical and cosmological observations such as solar system tests, 
gravitational radiation from pulsars, gravitational red shifts, the recently 
discovered gravitational lens effect and so on. Readers not very familiar with 
general relativity may consult Berry (1989) or the primer by Price (1982). 

Einstein proposed the following principles to construct the general theory of 
relativity 

(I) Principle of General Relativity: All laws in physics take the same forms in 
any coordinate system. 

(II) Principle of Equivalence: There exists a coordinate system in which the 
effect of a gravitational field vanishes locally. (An observer in a freely falling 
lift does not feel gravity until it crashes.) 

Any theory of gravity must reduce to Newton’s theory of gravity in the weak- 
field limit. In Newton’s theory, the gravitational potential <t> satisfies the Poisson 
equation 

AO = \nGp (7.200) 

where p is the mass density. The Einstein equation generalizes this classical result 
so that the principle of general relativity is satisfied. 

In general relativity, the gravitational potential is replaced by the components 
of the metric tensor. Then, instead of the LHS of (7.200), we have the Einstein 
tensor defined by 

G^ v = Ric^ - jg^yTl. (7.201) 

Similarly, the mass density is replaced by a more general object called the 

energy-momentum tensor 7 ,, . The Einstein equation takes a very similar 
form to (7.200): 

G^y = SnGT^. (7.202) 

The constant 8jtG is chosen so that (7.202) reproduces the Newtonian result in 
the weak-field limit. The tensor 7’ /n , is obtained from the matter action by the 
variational principle. From Noether’s theorem, T /n must satisfy a conservation 
equation of the form V /f T 1 ' 1 ’ — 0. A similar conservation law holds for G //l; (but 
not for Ric^y). We shall see in the next subsection that the LHS of (7.202) is also 
obtained from the variational principle. 

Exercise 7.24. Consider a metric 
2$ 

goo = — 1 ~ 

c- 


goi — 0 gij = &ij 


1 < i, j < 3 



and given by Too — pc 2 , 7b/ = Tij = 0 which corresponds to dust at 
rest. Show that (7.202) reduces to the Poisson equation in the weak-field limit 
(<J> /c 2 « 1). 

7.10.2 Einstein-Hilbert action 

This and the next example are taken from Weinberg (1972). The general theory 
of relativity describes the dynamics of the geometry, that is, the dynamics of 
g llv . What is the action principle for this theory? As usual, we require that the 
relevant action should be a scalar. Moreover, it should contain the derivatives of 
g^y : f vTiT d m x cannot describe the dynamics of the metric. The simplest guess 
will be Seh oc f 7l^/\g\ d m x. Since 1Z is a scalar and v^IiT dx 1 dx 2 . . . dx m is 
the invariant volume element, Seh is a scalar. In the following, we show that 
Seh indeed yields the Einstein equation under the variation with respect to the 
metric. Our connection is restricted to the Levi-Civita connection. We first prove 
a technical proposition. 

Proposition 7.2. Let (M, g) be a (pseudo-)Riemannian manifold. Under the 
variation g^ v -» g^ v + Sg^y, g ^ v , g and Ric llv change as 

(a) Sg^ = -8^8 Xv Sg K x (7.203) 

(b) Sg = gg^Sg^, S^\i\ = ^VlilS^Sg^ (7.204) 

(c) SRic^y = V K Sr K vl , - VySr K Kll (Palatini identity). (7.205) 

Proof, (a) From g K xg Xv = S K V , it follows that 

0 = S(g K xg Xv ) = Sg K x8 Xv + g K xtg Xv . 

Multiplying by g llK we find that Sg^ v = — g^ K g Xv Sg K x- 

(b) We first note the matrix identity ^(detg^y) = tr(ln g /; „). This can be 
proved by diagonalizing g /lv . Under the variation Sg^y , the LHS becomes Sg ■ g ~ 1 
while the RHS yields g^ v ■ Sg^ v , hence Sg = gg^Sg^y. The rest of (7.204) is 
easily derived from this. 

(c) Let T and T be two connections. The difference ST = T — T is a tensor 
of type (1,2), see exercise 7.5. In the present case, we take T to be a connection 
associated with g + Sg and T with g. We will work in the normal coordinate 
system in which F = 0 (of course 3 F ^ 0 in general); see section 7.4. We find 

SRic^y = - 3v«rv = v^rv - v„rV- 

[The reader should verify the second equality.] Since both sides are tensors, this 
is valid in any coordinate system. □ 

We define the Einstein-Hilbert action by 

Seh = J TZV~8 d 4 x . (7.206) 



The constant factor 1/16 ttG is introduced to reproduce the Newtonian limit when 
matter is added; see (7.214). We prove that S Seh = Oleads to the vacuum Einstein 
equation. Under the variation g — > g + Sg such that Sg -> 0 as |x| —*■ 0, the 
integrand changes as 

sazv^g) = sig^Ric^y/^g) 

= Sg^ RiC^y/^g + g^SRiC^y/^g + H8(V=g) 

= - g^g^SgKxRiCfivy/^g 

+ ^’(V^rv - v„rV)V=? + ^nV^g^Sg^. 

We note that the second term is a total divergence, 

= W v & rvV=£) - 

and hence does not contribute to the variation. From the remaining terms we have 

SSeh = f (-Ric» v + \ng^ S gflv V^d 4 x. (7.207) 

If we require that S .S'hi i = 0 under any variation Sg, we obtain the vacuum Einstein 
equation, 

G„ v = Ric^v - {g, lv n = 0 (7.208) 

where the symmetric tensor G is called the Einstein tensor. 

So far we have considered the gravitational field only. Suppose there exists 
matter described by an action 

S M = J m)yf=g d 4 * (7.209) 

where £(</>) is the Lagrangian density of the theory. Typical examples are the real 
scalar field and the Maxwell fields, 

S s = ~?f [g^drfW + m 2 0 2 ]V=g d 4 x (7.210a) 

Sed = ~\ J F^y/^g d 4 x (7.210b) 

where F jlv — d^A v — 3„ A M = V /x A „ — V„ A /x . If the matter action changes by 
S S'm under Sg, the energy-momentum tensor T /iv is defined by 

SS M = jJ T» v 8 gllV y/=g d 4 x. 


(7.211) 



Since Sg ^ is symmetric, T 1 ' 1 ’ is also taken to be so. For example, T llv of a real 
scalar field is given by 


Tav(x) = 2 


-Ss 


V~8 8g^ v (x) 

— 3/i03v0 - \ g^v(g Kl 3^03x0 + m 2 (jr). 


(7.212) 


Suppose we have a gravitational field coupled with a matter field whose 
action is 5 m- Now our action principle is 


3(5eh + <Sm) = 0 


(7.213) 


under g -* g + Sg. From (7.207) and (7.21 1), we obtain the Einstein equation 

G llv = 8jtGT iiv . (7.214) 

Exercise 7.25. We may add an extra scalar to the scalar curvature without spoiling 
the invariance of the action. For example, we can add a constant called the 

cosmological constant A, 

Seh = TT~~p f CR+A)V=gd 4 x. (7.215) 

1o7tG Jm 

Write down the vacuum Einstein equation. Other possible scalars may be such 
terms as 1Z 2 , Ric 111 ’ Ric^ v or R K Xnv R KXtlv ■ 


7.10.3 Spinors in curved spacetime 

For concreteness, we consider a Dirac spinor ^ in a four-dimensional Lorentz 
manifold M. The vierbein e" /; defined by 

8 l cv = e a fl e^ v r la p (7.216) 

defines an orthonormal frame {9 01 = e a ll dx ll \ at each point p e M. As noted 
before, a, ft, y, . . . are the local orthonormal indices while p., v, X, . . . are the 
coordinate indices. With respect to this frame, the Dirac matrices y a = e u lt y 1 ' 
satisfy [y a , y^j = 2 ifP. Under a local Lorentz transformation A 01 p (p), the 
Dirac spinor transforms as 

f{p) -> P(A)xKp) f{p) -> *{p)p{ A)- 1 (7.217) 

where ijr = i j/ 4 y° and p( A) is the spinor representation of A. To construct an 
invariant action, we seek a covariant derivative V„ ij/ which is a local Lorentz 
vector and transforms as a spinor. 


-> p(A)AjVpx[r. 


(7.218) 



If we find such a V ff t/r, an invariant Lagrangian may be given by 


£ = f (iy“V a + m) f (7.219) 

m being the mass of i fr. We note that e a II d ll ij/ transforms under A(p) as 

-> A«%%p(A)tfr = A„%^[p(A)9 ^ + 9„p(A )tfr]. (7.220) 
Suppose Vo, is of the form 

= e«/[9„ + £VhA. (7.221) 

From (7.218) and (7.220), we find that f 2 ^ satisfies 

p(A)fi /i p(A)“ 1 - 9 /x p(A)p(A) _1 . (7.222) 

To find the explicit form of Q lt , we consider an infinitesimal local Lorentz 
transformation A a ^(p) = S a & + s a ^(p). The Dirac spinor transforms as 

* -> exp[iie^S^]tA ~ [1 + ±i e^-ZapW (7.223) 

where E a p = ^i[y„, y^] is the spinor representation of the generators of the 
Lorentz transformation. E a p satisfies the o(l, 3) Lie algebra 

i [Llo'/t, Ej/,5] — PyP^o/S PyaEtps "H PspEy a PSaEyp. (7.224) 

Under the same Lorentz transformation, Q /t transforms as 

^ (1 + iie“^)£Vl - ±i e yS E yS ) - iia M e^E ttjg (l - \is y& E Y& ) 

= + ±ie^[E^, Q /( ] - (7.225) 

We recall that the connection one-form of p transforms under an infinitesimal 
Lorentz transformation as (see (7.152)) 

of p — > of p + e a yofp — of Y s v p — de a p (7.226a) 


or in components, 

r%/j -> r“^ + £°>r V - r\y S Yp - d, L e a p. (7.226b) 

From (7.224), (7.225) and (7.226b), we find that the combination 

^ = ±ir a ^E af ) = e a v V lt eP v E a p (7.227) 

satisfies the transformation property (7.222). In fact, 

iir“/E^ -* ±i( F“/ + e a Y TY/ - r% K e^ - d^Eap 
= ?ir a /E a p + ji(s a y r y /E a p - r% y e yfi E a p) 

- ±i9 ^e aP E a p 

= |ir a /E a p + ±i s aP [E a p, ±ir %%«] - ±i d^Eap. 



We finally obtain the Lagrangian which is a scalar both under coordinate 
changes and local Lorentz rotations, 

£ = ^[i K V(9„ + £ir V *Pv) + (7-228) 

and the scalar action 

s f =[ d 4 xV^^[iyV(a M + iir + ( 7 . 229 a) 

Jm 

If 1 1/ is coupled to the gauge field A, the action is given by 

S f =[ A A x^^Y a e a ,l {d^+A ll + \iTP l /i:p Y ) + m]^. (7.229b) 

Jm 

It is interesting to note that the spin connection term vanishes if dim M = 2. 
To see this, we rewrite (7.229a) as 

S f = 2 f + jirVfij''*. Vp Y } + m]1r (7.229a') 

" Jm 

where = y a e a fl and we have added total derivatives to the Lagrangian to 
make it Hermitian. The non- vanishing components of E are Eoi oc [yo, y\ ] oc yj, 
where yj is the two-dimensional analogue of y$. Since {y^, 73 } = 0, the spin 
connection term drops out from S,i, . 

7.11 Bosonic string theory 

Quantum held theory (QFT) is occasionally called particle physics since it deals 
with the dynamics of particles. As far as high-energy processes whose typical 
energy is much smaller than the Planck energy (~10 19 GeV) are concerned there 
is no objection to this viewpoint. However, once we try to quantize gravity in 
this framework, there exists an impenetrable barrier. We do not know how to 
renormalize the ultraviolet divergences that are ubiquitous in the QFT of gravity. 
In the early 1980s, physicists tried to construct a consistent theory of gravity 
by introducing supersymmetry. In spite of a partial improvement, the resulting 
supergravity could not tame the ultraviolet behaviour completely. 

In the late 1960s and early 1970s, the dual resonance model was extensively 
studied as a candidate for a model of hadrons. In this, particles are replaced 
by one-dimensional objects called strings. Unfortunately, it turned out that 
the theory contained tachyons (imaginary mass particles) and spin-2 particles 
and, moreover, it is consistent only in 26-dimensional spacetime! Due to 
these difficulties, the theory was abandoned and taken over by quantum 
chromodynamics (QCD). However, a small number of people noticed that the 
theory must contain the graviton and they thought it could be a candidate for the 
quantum theory of gravity. 




Figure 7.9. The trajectories of an open string ( a ) and a closed string ( b ). Slices of the 
trajectories at fixed parameter tq are also shown. 

Nowadays, supersymmetry has been built into string theory to form the 
superstring theory, which is free of tachyons and consistent in ten-dimensional 
spacetime. There are several candidates for consistent superstring theories. It is 
sometimes suggested that complete mathematical consistency will single out a 
unique theory of everything (TOE). 

In this book, we study the elementary aspects of bosonic string theory in the 
final chapter. We also study some mathematical tools relevant for superstrings. 
The classical review is that of Scherk (1975). We give more references in 
chapter 14. 

7.11.1 The string action 

The trajectory of a particle in a D-dimensional Minkowski spacetime is given by 
the set of D functions X 1 ' (r), 1 < // < D, where r parametrizes the trajectory. 
A string is a one-dimensional object and its configuration is parametrized by two 
numbers (er, r), a being spacelike and r timelike. Its position in D-dimensional 
Minkowski spacetime is given by X^io, r), see figure 7.9. The parameter a can 
be normalized as a e [0, tt ] . A string may be open or closed. We now seek an 
action that governs the dynamics of strings. 

We first note that the action of a relativistic particle is the length of the world 

line , 



(7.230) 


where X 1 ' = dW'/dr. For some purposes, it is convenient to take another 
expression, 



(7.231) 


where the auxiliary variable g = g TT is regarded as a metric. 


Exercise 7.26. Write down the Euler-Lagrange equations derived from (7.231). 
Eliminate g from (7.231) making use of the equation of motion to reproduce 

(7.230) . 

What is the advantage of (7.231) over (7.230)? We first note that (7.231) 
makes sense even when mr = 0, while (7.230) vanishes in this case. Second, 

(7.231) is quadratic in X while the A'-dependence of (7.230) is rather complicated. 
Nambu (1970) proposed an action describing the strings, which is 

proportional to the area of the world sheet, the surface spanned by the trajectory 
of a string. Clearly this is a generalization of the length of the world line of a 
particle. He proposed the Nambu action, 

5 = ~2na' Jo d(7 / ' dT [ ~ det ( 9 «*%^)] 1/2 (7-232) 

where §° = r, f 1 — a and d a X ^ = dX^/d The parameter r; (tf) is the 
initial (final) value of the parameter r while a' is a parameter corresponding to 
the inverse string tension (the Regge slope). 

Exercise 7.27. The action S is required to have no dimension. We take a and r to 
be dimensionless. Show that the dimension of a' is [length] 2 . 

Although the action provides a nice geometrical picture, it is not quadratic 
in X and it turned out that the quantization of the theory was rather difficult. Let 
us seek an equivalent action which is easier to quantize. We proceed analogously 
to the case of point particles. A quadratic action for strings is called the Polyakov 
action (Polyakov 1981) and is given by 

S = ~4 Jo da J ' ^^SS^daX^dpX^ (7.233) 

where g — det g a p and g a P = (g -1 )“^. If the string is open, the trajectory is 

a sheet while if it is closed, it is a tube, see figure 7.9. It is shown here that the 

action (7.233) agrees with (7.232) upon eliminating g. It should be noted though 
that this is true only for the Lagrangian. There is no guarantee that this remains 
true at the quantum level. It has been shown that the quantum theory based on the 
respective Lagrangians agrees only for D = 26. The action (7.233) is invariant 
under 

(i) local reparametrization of the world sheet 

t —*■ x'{x, a) a->a'(r,a) (7.234a) 

(ii) Weyl rescaling 

gap^g'ap^e+Mgap (7.234b) 


(iii) global Poincare invariance 

X 11 -* X^' = A^ v r + fl" A e SO(D — 1, 1) a e (7.234c) 
These symmetries will be worked out later. 

Exercise 7.28. Taking advantage of symmetries (i) and (iii), it is always possible 
to choose g a p in the form g a p — i] a p . Write down the equation of motion for X 11 
to show that it obeys the equation 

rj^dadpX^ 1 = 0. (7.235) 


7.11.2 Symmetries of the Polyakov strings 

The bosonic string theory is defined on a two-dimensional Lorentz manifold 
(M, g ). The embedding / : M — > is defined by £“ m* X ^ where 
{£" ) = (r, er) are the local coordinates of M. We assume the physical spacetime 
is Minkowskian (IT 0 , rj) for simplicity. The Polyakov action 

S = ~kj V=gg all a a X' t dpX v r hlv (7.236) 

is left invariant under the coordinate reparametrization Diff(M) since the volume 
element ^/—gd 2 ^ is invariant and g a ^ 8 a X 11 dp X is a scalar. 

Now we are ready to derive the equation of motion. Our variational 
parameters are the embedding X 11 and the geometry g a p. Under the variation 
SX /i , we have the Euler-Lagrange equation 

da(V^88 ap dpX^ = 0. (7.237a) 

Under the variation 8 g a p, the integrand of S changes as 

siV^g^daX^dpX^) = &v=gg a (>d a x»apx ll + y/^gSg^daXVdpXn 
= - kV^ggysSg^g^d^dpX,, 

+ V=8Sg a f t a a x' l dpX ll 

where proposition 7.2 has been used. Since this should vanish for any variation 
Sg a p, we should have 

T a p = dcX^dpX,, - y a p(g yS d Y X^dsX^) = 0. (7.237b) 

This is solved for g a p to yield 

gap = daX^d pX v V „ v (7.238) 

showing that the induced metric (the RHS) agrees with g a p. Substituting (7.238) 
into (7.236) to eliminate g a p , we recover the Nambu action, 


S = ~\j d2 ? 7-del (dcX^dpX^). 


(7.239) 



By construction, the action S is invariant under local reparametrization of 
M, {£“} — »• {£'“(£)}. In addition to this, the action has extra invariances. Under 
the global Poincare transformation in Z)-dimensional spacetime, 

-> x ,fl = A» V X V + a 11 (7.240) 

the action S transforms as 

d 2 | V ZI 88 aP d a (A^ K X K + a fl )dp(A\X x + a v )r, llv 

= ~kf ^V z: 88 afi daX K df i X x (Ai 1 K A v ^ l , v ). 

From A v A»7 l ttv = >7 kA, we find that S is invariant under global Poincare 
transformations. The action S is also invariant under the Weyl rescaling, 
Safi (r, a) -» e 2cr(r ' a) g a ,p ( L cr) keeping (r, a) fixed. In fact, S transforms as 

S — ► J d 2 if —e, 4a geA 2a 9 „ X 11 d p X" r hlv 

and hence is left invariant. Note that the Weyl rescaling invariance exists only 
when M is two dimensional, making strings prominent among other extended 
objects such as membranes. 

Since dimM = 2, we can always parametrize the world sheet by the 
isothermal coordinate (example 7.9) so that 

gap =e 2or(T ' cr) n aP . (7.241) 

Then the Weyl rescaling invariance allows us to choose the standard metric ij a p 
on the world sheet. The metric g a p has three independent components while the 
reparametrization has two degrees of freedom and the Weyl scaling invariance 
has one. Thus, so long as we are dealing with strings, we can choose the standard 
metric rj a p. 

We end our analysis of Polyakov strings here. Polyakov strings will be 
quantized in the most elegant manner in chapter 14. 

Exercise 7.29. Let (M, g) and (N, h) be Riemannian manifolds. Take a chart U 
of M in which the metric g takes the form 

g = g/tvC*) ® d.v y . 

Take a chart V of N on which h takes the form 

h = G a p (4>)d(j) a <g> d </A 

A map 4> : M —*■ N defined by x (f> (x ) is called a harmonic map if it satisfies 


Show that this equation is obtained by the variation of the action 


S=jJ d m x (7.243) 

with respect to <f>. Applications of harmonic maps to physics are found in Misner 
(1978) and Sanchez (1988). Mathematical aspects have been reviewed in Eells 
and Lemaire (1968). 

Problems 

7.1 Let V be a general connection for which the torsion tensor does not vanish. 
Show that the first Bianchi identity becomes 

©{/?(*, Y)Z) = 6 {T(X, [Y, Z])} + &{V X [T(Y, Z)]} 

where & is the symmetrizer defined in theorem 7.2. Show also that the second 
Bianchi identity is given by 

6 {(V X R)(Y, Z)}V = 6 {R(X, T(Y, Z))}V 

where & symmetrizes X , Y and Z only. 

7.2 Let (M, g) be a conformally flat three-dimensional manifold. Show that the 
Weyl-Schouten tensor defined by 

Cxjiv — V I, Ri exit, V,, Rjc, A 4 (gk/i9 v TZ gxvd^'R,) 

vanishes. It is known that = 0 is the necessary and sufficient condition for 
conformal flatness if dimM = 3. 

7.3 Consider a metric 

g — —dr <g> dr + dr <g> dr + (1 — 4 fi 2 )r 2 d(f> <g) dcp + d; <g) dz 
where 0 < [x < 1/2 and /i ^ 1/4. Introduce a new variable 

cp = {l 

and show that the metric g reduces to the Minkowski metric. Does this mean that 
g describes Minkowski spacetime? Compute the Riemann curvature tensor and 
show that there is a stringlike singularity at r = 0. This singularity is conical (the 
spacetime is flat except along the line). This metric models the spacetime of a 
cosmic string. 



8 


COMPLEX MANIFOLDS 


A differentiable manifold is a topological space which admits differentiable 
structures. Here we introduce another structure which has relevance in physics. 
In elementary complex analysis, the partial derivatives are required to satisfy the 
Cauchy-Riemann relations. We talk not only of the differentiability but also of 
the analyticity of a function in this case. A complex manifold admits a complex 
structure in which each coordinate neighbourhood is homeomorphic to O" and 
the transition from one coordinate system to the other is analytic. 

The reader may consult Chern (1979), Goldberg (1962) or Greene (1987) 
for further details. Griffiths and Harris (1978), chapter 0 is a concise survey of 
the present topics. For applications to physics, see Horowitz (1986) and Candelas 
(1988). 

8.1 Complex manifolds 

To begin with, we define a holomorphic (or analytic) map on C " . A complex- 
valued function / : C”' — > C is holomorphic if / = f\ + i fi satisfies the 
Cauchy-Riemann relations for each zJ' = x 1 ’ + i v ;/ , 

_9/i_ _ _9/2_ (81) 

dx ^ 3 dx P dy ^ 

A map (Z 1 , ...,/") : C" — > C" is called holomorphic if each function /'■ 
(1 < X < n) is holomorphic. 

8.1.1 Definitions 

Definition 8.1. M is a complex manifold if the following axioms hold, 

(i) M is a topological space. 

(ii) M is provided with a family of pairs { ([/,• , <Pi)}. 

(iii) {U,} is a family of open sets which covers M. The map q>, is a 
homeomorphism from iJ t to an open subset U of C”' . [Hence, M is even 
dimensional.] 

(iv) Given U\ and U j such that [/,• fl Uj 0, the map f/jj = tpj o (p~ l from 
<Pi(Ui fl Uj) to (fij ( Uj fl Uj) is holomorphic. 



The number m is called the complex dimension of M and is denoted as 
dime M = in. The real dimension 2m is denoted either by dim;:: M or simply 
by dimM. Let z M = <pi(p) and w v — (Pj{p ) be the (complex) coordinates of 
a point p e Uj D Uj in the charts ( Ui,<pi ) and ( Uj,(pj ), respectively. Axiom 
(iv) asserts that the function w v = it 11 + ii> v (1 < v < m ) is holomorphic in 
zj l = x ,A + i y^, namely 

du v dv v du v dv v 

= = 1 < /x, v < m. 

dx v 3 y v 3 y v dx v 

These axioms ensure that calculus on complex manifolds can be carried out 
independently of the special coordinates chosen. For example, C m is the simplest 
complex manifold. A single chart covers the whole space and tp is the identity 
map. 

Let {{Ui, </?/)} and {(Vj, i j/j)} be atlases of M. If the union of two atlases is 
again an atlas which satisfies the axioms of definition 8.1, they are said to define 
the same complex structure. A complex manifold may carry a number of complex 
structures (see example 8.2). 


8.1.2 Examples 

Example 8.1. In exercise 5.1, it was shown that the stereographic coordinates of 
a point P(x, y. z) e S 2 — {North Pole} projected from the North Pole are 


(X, Y ) = 



while those of a point P(x, y. z) e S 2 — {South Pole} projected from the South 
Pole are 


(U, V) = 


-y 


l + z i + z 


[Note the orientation of ((/, V) in figure 5.5.] Let us define complex coordinates 


Z = X + iY, Z = X-iY, W = U + iV, W=U -iV. 


IT is a holomorphic function of Z, 


W = 


x — iy 

1 + z 


1 - z 
1 + z 


(X - i F) 


X-i Y 
X 2 + Y 2 


1 

Z 


Thus, S 2 is a complex manifold which is identified with the Riemann sphere 
C U {oo}. 


Example 8.2. Take a complex plane C and define a lattice L(a >\ , C 02 ) = {co\m + 
a> 2 n\m, n e Z] where o>\ and an are two non- vanishing complex numbers such 



Figure 8.1. Two complex numbers u>\ and a>2 define a lattice L{a >\ , idt) in the complex 
plane. C/L(a >\ , uit) is homeomorphic to the torus (the shaded area). 


that 022/021 £ IK; see figure 8 . 1 . Without loss of generality, we may take 
Im(&>2/<wi) > 0 . The manifold C/L(&>i , 02 2) is obtained by identifying the points 
zi e C such that zi — zi — 021m + a>2n for some m,n e Z. Since the 
opposite sides of the shaded area of figure 8.1 are identified, C/L(a>i,&>2) is 
homeomorphic to the torus T 2 . The complex structure of C naturally induces 
that of C/L(&>i , o>2). We say that the pair (&>i , 022) defines a complex structure on 
T 2 . There are many pairs (a> \ , 022) which give the same complex structure on T 2 . 

When do pairs (<22 \ , 022) and (Im(o)2/tt>i) > 0 , ImOw^/wj) > 0 ) 

define the same complex structure? We first note that two lattices L(coi , 022) and 
L(&/j , 02' 2 ) coincide if and only if there exists a matrix 1 

^ e PSL( 2 , Z) = SL( 2 , Z)/Z 2 


( a b 
c d 


such that 

( w 'i \ _ ( a b \ ( \ 

{02' d J \ 022 J ' 

This statement is proved as follows. 

Suppose 


( 


02 , 


co ; 


a b \ / u>\ \ 

c d ) \ 022 ) 


where 


a b 
c d 


e SL( 2 , Z). 


The group SL( 2 , Z) has been defined in ( 2 . 4 ). Two matrices A and — A are identified in PSL( 2 , Z). 


1 


Since m[, m' 2 e L(m i, m2 ), we find L(®p oil,) C L(&>i, &>2)- From 

/ m \ = ( d -b \ / m\ \ 

\ 0> 2 ) \ —c a )\ m' 2 ) 

we also find L(m i, m2) C m 2 ). Thus, L(a>i, m 2) = L(m' x , &/,). Conversely, 

if L(<wi, (W2) = &>(,), m[ and mi, are lattice points of L(m 1, W2) and can be 

written as &>' j = r/cui + cm 2 and m' 2 — bm\ + am2 where a, b,c, d e Z. Also 
m\ and m2 may be expressed as m\ = d' m[ + c'm' n and m2 = b’m' x + a'm' 2 where 
a\ b' , c' , d' eh. Then we have 


( 


m 1 
m2 




a ' \ ( a b 

c' d! )\c d 


m\ 

m2 


from which we find 


( a' 

b ’ \ 

( a 

b 

\ c' 

d' j 

l c 

d 


1 

0 


0 

1 


Equating the determinants of both sides, we have [a! d' — b'c')(ad — be) = 1 . All 
the entries being integers, this is possible only when ad — be — ± 1 . Since 

( bm\ + aa>2\ ad — be ^ /&4 \ 

dm\ + cm2 ) |c(tt>2/tt»i) + d \ 2 \m[J 

we must have ad — be > 0 , that is. 



(“ J)^L( 2 ,Z). 

In fact, it is clear that 

(c ^ e S L ( 2 , Z) 

defines the same lattice as 

( a b 
c d 

and we have to identify those matrices of SL( 2 , Z) which differ only by their 
overall signature. Thus, two lattices agree if they are related by PSL( 2 , Z) = 
SL( 2 , Z)/Z 2 . 

Assume that there exists a one-to-one holomorphic map h of C/ L(o>\ , m2) 
onto C/L(d>i , m2) where Im (m2/m\) > 0 , hn(m2/m\) > 0 . Let p : C — > 
C/L(m \ , m2) and p : C — >• C/L(m 1 , m2) be the natural projections. For example, 
p maps a point in C to an equivalent point in C/L{m 1 , ®2). Choose the origin 0 
and define /z*( 0 ) to be a point such that p o /?*( 0 ) = h o p( 0 ) (figure 8 . 2 ), 

C > C 

p{ [p ( 8 . 3 ) 

C/L(m \ , m2) > C/L(d>i , 0)2). 





Figure 8.2. A holomorphic bijection h : C/ L(o >\ , m2) — >■ C/L(m\ , m2) and the natural 
projections p : C -*■ C/ L(m\ , m2), p : C — >■ C/L(m\ , m2) define a holomorphic bijection 
/1* : C C. 


Then by analytic continuation from the origin, we obtain a one-to-one 
holomorphic map /z* of C onto itself satisfying 

p o /z*(z) = h o p(z ) for all z e C ( 8 . 4 ) 

so that the diagram ( 8 . 3 ) commutes. It is known that a one-to-one holomorphic 
map of C onto itself must be of the form z — > A*(z) = az + b, where a, b e C 
and a ^ 0. We then have h*(co 1) — /t*(0) = awi and /z*(to 2) — /t*( 0 ) = at«2- 
For h to be well defined as a map of C/L(o> 1 , <02) onto C/L(<y 1 , 022), we must 
have ao>i, a«2 e L(w\, C02), see figure 8 . 2 . By changing the roles of (o>i, u> 2) 
and (ojj , a>2), we have ncwi , aa>\ e L(<ui , m2) where o ^ 0 is a complex number. 
Hence, we conclude that if C/L(a>i , m2), C/L(d> 1 , an) have the same complex 
structure, there must be a matrix M e SL( 2 , Z) and a complex number X (=d -1 ) 
such that 



Conversely, we verify that (co 1 , m2) and (a >[ , m^) related by ( 8 . 5 ) define the same 
complex structure. In fact, 


mi 

m2 


and 





Figure 8.3. The quotient space H / PSL(2, Z). 


define the same lattice (modulo translation) and we may take h * : C — > C to he 
z i->- z + b. L(a>\, a> 2 ) and L(Xco \ , Xcoj) also define the same complex structure. 
We take, in this case, /?* : z >->• Xz + b. 

We have shown that the complex structure on T 2 is defined by a pair of 
complex numbers (&q , <02 ) modulo a constant factor and PSL(2, Z). To get rid of 
the constant factor, we introduce the modular parameter r = 002 / ( 0 \ e H = {z e 
C| Im z > 0|, to specify the complex structure of T 2 . Without loss of generality, 
we take 1 and r to be the generators of a lattice. Note, however, that not all of 
r e H are independent modular parameters. As was shown previously, r and 
x' — (ax + b)/(cx + d) define the same complex structure if 

(° 2 ) 6PSL<2 ' Z) - 

The quotient space H/PSL(2, Z) is shown in figure 8.3, the derivation of which 
can be found in Koblitz (1984) p 100, and Gunning (1962) p 4. 

The change r —*■ x' is called the modular transformation and is generated 
by r — »■ r + 1 and r —*■ — 1/r. The transformation r —*■ x + 1 generates a 
Dehn twist along the meridian m as follows (figure 8.4(a)). (i) First, cut a torus 
along m . (ii) Then take one of the lips of the cut and rotate it by 2n with the other 
lip kept fixed, (iii) Then glue the lips together again. The other transformation 
r —> — 1/r corresponds to changing the roles of the longitude / and the meridian 
m (figure 8.4(b)). 

Example 8.3. The complex projective space CP" is defined similarly to WP" ; 
see example 5.4. The ntuple z = (z°, . . . , z") e C" +1 determines a complex 
line through the origin provided that z ^ 0. Define an equivalence relation 


(a) lb) 



Figure 8.4. (a) Dehn twists generate modular transformations, (b) x — > — 1/r changes 
the roles of l and m. 

by z ~ w if there exists a complex number a ^ 0 such that w — az. 
Then CP" = (C !+1 - {0})/ ~. The (n + 1) numbers z°, z 1 , . ■ . , z" are 
called the homogeneous coordinates, which is denoted by [z , z 1 , . . . , z n ] where 
(z°, . . . , z n ) is identified with (Az 0 , . . . , Az") (A ^ 0). A chart U is a subset of 
C" +1 — {0} such that z 1 ' ^ 0. In a chart G /x , the inhomogeneous coordinates are 
defined by i-Y-. = z v /z^ (v ^ p). In fl U v ^ 0, the coordinate transformation 
Vv = C" -> C is 

Accordingly, t is a multiplication by z v /z /i , which is, of course, holomorphic. 

Example 8.4. The complex Grassmann manifolds Gk.n (C) are defined similarly 
to the real Grassmann manifolds; see example 5.5. Gk.n (C) is the set of complex 
A'-dimensional subspaces of C" . Note that CP" = Gi jH +i(Q. 

Let Mk.n (C) be the set of k x n matrices of rank k (k < n). Take 
A , B e Mk.n (C) and define an equivalence relation by A ~ B if there exists 
g e GL(£, C) such that B = gA. We identify G*, n (C) with Mk. n (C)/GL(&, C). 
Let { A i , . . . , A/} be the collection of all the k x k minors of A e Mk. n (C). We 
define the chart U a to be a subset of Gk.n (Q such that det A a ^ 0. The k(n — k ) 
coordinates on U a are given by the non-trivial entries of the matrix A~ l A. See 
example 5.5 for details. 




Example 8.5. The common zeros of a set of homogeneous polynomials are a 
compact submanifold of CP' 1 called an algebraic variety. For example, let 
P(z°, ... ,z n ) be a homogeneous polynomial of degree d . If a ^ 0 is a complex 
number, P satisfies 


P(az°, az n ) — a d P(z°, ..., z n )• 

This shows that the zeros of P are defined on CP ' 1 ; if P (z° , . . . , z n ) = 0 then 
P([z°, . . . , z"]) = 0. For definiteness, consider 

P(z°, z\ z 2 ) = (z 0 ) 2 + (z 1 ) 2 + (z 2 ) 2 


and define N by 

N = {[z°, z 1 , z 2 ] e CP 2 |P(z°, z 1 , z 2 ) = 0}. (8.7) 

We define U ^ as in example 8.3. In N fl Uo, we have 

[?( 0)] 2 + [ f , 0)] 2 + 1 = 0 

where = z^/z 0 (note that z° i=- 0). Consider a holomorphic change of 
coordinates , £ 2 0) ) (v l = ^ ( o v »7 2 = [£ ( o)] 2 + [| ( 2 0) ] 2 + 1). Note that 
dip 1 , i] 2 )/d(^ 0y § ( 2 0) ) / Ounless£ 2 0) = z 2 = 0. Then Nr\U 0 r\U 2 = {(/?\ >; 2 ) e 
C 2 1 1 ] 2 = 0} is clearly a one-dimensional submanifold of C 2 . If £ 2 0) — z 2 = 0, we 
have (§( 0 ), | ( 2 0) ) (f 1 = [§ ( ' 0) ] 2 + [§ ( 2 0) ] 2 + 1, C 2 = | ( 2 0) ) for which the Jacobian 

does not vanish unless = z 1 =0. Then N fl Uq fl U\ — {(£ x , £ 2 ) e C 2 1 = 
0} is a one-dimensional submanifold of C 2 . On N fl Uq fl U\ fl U 2 . the coordinate 
change i-^ £ 2 is a multiplication by z 2 /z 1 and is, hence, holomorphic. In this 
way, we may define a one-dimensional compact submanifold N of CP 2 . 

A complex manifold is a differentiable manifold. For example, C" 1 is 
regarded as K 2 '" by the identification z ^ = x^ + iy^, x M , y ,x e ffi. Similarly, 
any chart U of a complex manifold has coordinates (z 1 , . . . , z m ) which may be 
understood as real coordinates (x 1 , y 1 , . . . , x m , y " 1 ) . The analytic property of the 
coordinate transformation functions ensures that they are differentiable when the 
manifold is regarded as a 2m -dimensional differentiable manifold. 

8.2 Calculus on complex manifolds 
8.2.1 Holomorphic maps 

Let f : M N, M and N being complex manifolds with dime M — m and 
dime N — n. Take a point p in a chart ( U , q>) of M. Let ( V , ir) be a chart of N 
such that f(p) e V. If we write {z^} = cp(p ) and {w v } = i k(f(p)), we have a 
map \[r o / o <p~ l : C' 1 -> C" . If each function ie v (l<v<n)isa holomorphic 



function of z 1 ', f is called a holomorphic map. This definition is independent 
of the special coordinates chosen. In fact, let (U\ tp') be another chart such that 
U IT U' ^ 0 and = x' x + \y' x be the coordinates. Take a point p e U fl U' . If 
w v = u v + iv v is a holomorphic function with respect to z, then 

du v _ du v dx 1 * du v dy>* _ dv v dy 1 * dv v dx >* _ dv v 

dx ,x dx 1 * dx ,x dy t* dy ,x dy 1 * dy ,x dx 1 * dy' x dy ,x 

We also find du v /dy ,x = — dv v /dx ,x . Thus, w v is holomorphic with respect to 
z! too. It can be shown that the holomorphic property is also independent of the 
choice of chart in N. 

Let M and N be complex manifolds. We say M is biholomorphic to 
N if there exists a diffeomorphism / : M — * N which is also holomorphic 
(then f~ l : N — > M is automatically holomorphic). The map / is called a 

biholomorphism. 

A holomorphic function is a holomorphic map / : M — »• C. There is 
a striking theorem; any holomorphic function on a compact complex manifold 
is constant. This is a generalization of the maximum principle of elementary 
complex analysis, see Wells (1980). The set of holomorphic functions on M 
is denoted by O(M). Similarly, O(U) is the set of holomorphic functions on 
U C M. 

8.2.2 Complexifications 

Let M be a differentiable manifold with dimp M — m. If / ; M — > C is 
decomposed as / = g + ih where g, h e T(M), then / is a complex-valued 
smooth function. The set of complex-valued smooth functions on M is called the 
complexification of 31. M), denoted by .T(;V/ ) . A complexified function does 
not satisfy the Cauchy-Riemann relation in general. For f — g + ih e 33 M ) “, 
the complex conjugate of / is / = g — ih. f is real if and only if / = /. 

Before we consider the complexification of T p M, we define the 
complexification V of a general vector space V with dimp : V — m. An element 
of V c takes the form X + i Y where X.Y e V. The vector space V ' becomes 
a complex vector space of complex dimension m if the addition and the scalar 
multiplication by a complex number a + ib are defined by 

(Xj + iF x ) + (X 2 + i Y 2 ) = (X! + X 2 ) + i(Ti + Y 2 ) 

(< a + ib)(X + iF) = (aX - bY ) + i(bX + aY) 

(P (P 

V is a vector subspace of V since X e V and X + iO e V may be identified. 
Vectors in V are said to be real. The complex conjugate of Z = X + iF is 
Z — X — iF. A vector Z is real if Z = Z. 

A linear operator A on V is extended to act on V ' as 


A(X + iF) = A(X) + iA(F). 


( 8 . 8 ) 



If A —*■ K is a linear function (A e V*), its extension is a complex-valued 
linear function on V c , A : V ' — >• C In general, any tensor defined on V 
and V* is extended so that it is defined on V and ( V* ) ' . An extended tensor is 
complexified as t = t\ + if 2 , where t\ and f 2 are tensors of the same type. The 
conjugate of t is 1 = t\ — if 2 . If t = 7, the tensor is said to be real. For example 
A : — > C is real if A(X + iY) = A{X - iT). 

Let {eic} be a basis of V . If the basis vectors are regarded as complex 
vectors, the same basis {ek} becomes a basis of V c . To see this, let X = X k ek, 
Y = Y k eic e V. Then Z = X + iT is uniquely expressed as (X k + i Y k )ek- We 
find dim® V — dime 

Now we are ready to complexify the tangent space T p M. If V is replaced by 
T p M, we have the complexification T p M c of T p M, whose element is expressed 
as Z = X + iT (X, Y e T p M). The vector Z acts on a function / = f\ + i /2 e 
3TM) C as 


Z[/] = X[/i+i/ 2 ] + iT[/i+i/ 2 ] 

= X[/i] - Y[f 2 \ + i {X[f 2 \ + Y [.fi]}. (8.9) 

The dual vector space T*M is complexified if a>, q e T*M are combined as 
f = co + i>;. The set of complexified dual vectors is denoted by ( T* M ) : . 
Any tensor t is extended so that it is defined on T p M c and (T* M)" and then 
complexified. 

Exercise 8.1. Show that (T^M)^ — ( T p M c )*. From now on, we denote the 
complexified dual vector space simply by 

Given smooth vector fields X,Ye X(M), we define a complex vector field 
Z = X + iT. Clearly Z\ p e T p M . The set of complex vector fields is the 
complexification of X(M) and is denoted by X(M ) ' . The conjugate vector field 
of Z = X + iT is Z = X - iT. Z = Z if Z e X(Af), hence X(M) C D X(M). 
The Lie bracket of Z = X + iT, W — U + iV e X(M) C is 

[X + i Y,U + iT] = {[X, U] - [Y, V]} + i{[X, V] + [Y, U]}. (8.10) 

The complexification of a tensor field of type (p, q) is defined in an obvious 
manner. If co. q e £2 l (M), £ = to + iq e (M) c is a complexified one-form. 

8.2.3 Almost complex structure 

Since a complex manifold is also a differentiable manifold, we may use the 
framework developed in chapter 5. We then put appropriate constraints on 
the results. Let us look at the tangent space of a complex manifold M with 
dime M = m. The tangent space T p M is spanned by 2m vectors 

a a _ a a | 

dx l ’ ’ dx m ay 1 ’ ’ 9 y m ] 


( 8 . 11 ) 


where z ** = x M + iy ^ are the coordinates of p in a chart ( U , (p). With the same 
coordinates, T*M is spanned by 


{dx 1 dx m ; dy 1 , . . . , dy'” j . 


( 8 . 12 ) 


Let us define 2m vectors 


3 _ i 3 .31 

dz 1 * 2 dx v dy 11 J 


3 

3F 


1 

2 


3 3 ] 

3“ i r 

dx v 3 J 


(8.13a) 

(8.13b) 


where 1 < pu < m. Clearly they form a basis of the 2m -dimensional (complex) 
vector space T p M c . Note that 3/3 z 1 * = d/dlf* . Correspondingly, 2m one-forms 


dz 1 * = dx 1 * + i dy^ d z/* = dx - i d y 1 * (8.14) 

form the basis of T*M C . They are dual to (8.13), 

<d z 1 *, 3/3 T) = {dz 1 *, d/d z v ) = 0 (8.15a) 

{dz 1 *, d/dz v ) = {dz 1 *, d/d z v ) = S>* v . (8.15b) 

Let M be a complex manifold and define a linear map J p : T p M — »• T p M 
by 

/ 3 \ _ 3 / 3 \ _ 3 

P \dx^) dy 11 P \dyt* ) dx 1 * 

J p is a real tensor of type (1 , 1). Note that 

J p — —id t p m- (8.17) 

Roughly speaking, J p corresponds to the multiplication by ±i. The action of J p 
is independent of the chart. In fact, let ( U , <p) and ( V, \ Jr) be overlapping charts 
with cp(p ) = z 1 * — x 1 * + iy 1 * and i {r(p) = w 1 * — u 1 * + iv 1 *. On U (T V, the 
functions z/‘ — z/‘ (w) satisfy the Cauchy-Riemann relations. Then we find 

/ 3 /3x v 3 dy v 3 \ _ dy v d dx v 3 _ 3 

p \3 ) p \3 dx v 3 3y y / 3i>^ 3y v dv^ 3x v dv^ 

We also find that J p d/dv ** = —3/3 u 1 * . Accordingly, J p takes the form 


Jp — 



(8.18) 


with respect to the basis (8.11), where /,„ is the m x m unit matrix. Since all 
the components of J p are constant at any point, we may define a smooth tensor 
field J whose components at p are (8.18). The tensor field J is called the almost 



complex structure of a complex manifold M. Note that any 2m -dimensional 
manifold locally admits a tensor field J which squares to — h m ■ However, J may 
be patched across charts and defined globally only on a complex manifold. The 
tensor J completely specifies the complex structure. 

The almost complex structure J p is extended so that it may be defined on 
T p M c , 

J p (X + iZ) = J p X + i J p Y. (8.19) 

It follows from (8.16) that 


Jpd/dz^ = i9/9z M Jpd/dz* = -id/dz». 
Thus, we have an expression for J p in (anti-)holomorphic bases, 


Jp — i dz^ <g> 


9 

dz ^ 


id 


9 

8V 1 


whose components are given by 


Jp — 


i Im 0 
0 -i In 


( 8 . 20 ) 


( 8 . 21 ) 


( 8 . 22 ) 


Let Z e T p M ' be a vector of the form Z = Z / ' 9/9~ / ' . Then Z is an eigenvector 
of J p \ J p Z = iZ. Similarly, Z = Z , '9/9’ // satisfies J p Z — — iZ. In this way 
TpM"'- of a complex manifold is separated into two disjoint vector spaces, 


T p M C = T p M + © T p M~ (8.23) 

where 

T p M ± = {Z e T p M c \J p Z = ±iZ}. (8.24) 

We define the projection operators V ± : T p M c — »■ T p M ± by 

V ± = jihmTiJp)- (8.25) 

In fact, J p V ± Z = ^( J p © iJp)Z = ±i V ± Z for any Z e TpM' 1 -' . Hence, 

Z ± = V ± Z e T p M ± . (8.26) 

Now Z e T p M c is uniquely decomposed as Z = Z + + Z~ (Z ± e T p M ± ). 
T p M + is spanned by {9/9z M } and T p M~ by {9/9z M }. Z e T p M + is called a 

holomorphic vector while Z e T p M~ is called an anti-holomorphic vector. 

We readily verify that 

T p M~ = T P M+ = {Z|Z e T p M + }. (8.27) 


Note that 


dime T p M + — dime T p M = \ dime T p M c — ^ dime M. 



Exercise 8.2. Let (U . cp) and (V, \[r) be overlapping charts on a complex manifold 
M and let z M = (pip ) and w ^ = \ jr{p). Verify that X = X^d/dz^, expressed 
in the coordinates w 1 ', contains a holomorphic basis {(l/th/T [ only. Thus, the 
separation of T p M into T p M ± is independent of charts (note that J is defined 
independently of charts). 

Given a complexified vector field Z e X(M) C , we obtain a new vector field 
JZ e X(M ) c defined at each point of M by JZ\ p — J p ■ Z\ p . The vector field 
Z is naturally separated as 

Z = Z+ + Z“ Z ± = V ± z (8.28) 

where Z± — T >± Z. The vector field Z + (Z~) is called a holomorphic (anti- 
holomorphic) vector field. Accordingly, once J is given, X(M) C is decomposed 
uniquely as 

X(M) C = X(M) + © X(M)“. (8.29) 

Z = Z+ + Z~ e X(M) C is real if and only if Z + = Z~ . 

Exercise 8.3. Let X. Y e X(M) + . Show that [X, Y] e X(M) + . [If X. Y e 
X(M)~, then [X. Y] e X(M)~ .] 

8.3 Complex differential forms 

On a complex manifold, we define complex differential forms by which we will 
discuss such topological properties as cohomology groups. 

8.3.1 Complexification of real differential forms 

Let M be a differentiable manifold with dim;:; M = m . Take two q -forms 
w, i] e at p and define a complex <7 -form p — a> +uj. We denote the 

vector space of complex g-forms at p by f2|(M) c . Clearly Q q p (M) C S2|(M) C . 
The conjugate of f is t, — 00 — irj. A complex g-form t, is real if t, — t,. 

Exercise 8.4. Let co e Q, q p (M) c . Show that 


cu(Vi V q ) = wiV i,...,V q ) Vi e T p M c . (8.30) 

Show also that co + q — co + rj, Xco — Xco and co — co, where co, rj e Q, q p iM) c and 
AeC 

A complex q -form a defined on a differentiable manifold M is a smooth 
assignment of an element of Q. q p (M) c . The set of complex q -forms is denoted by 
Q. q (M) c . A complex q - form £ is uniquely decomposed as £ = co + i/ 7 , where 
co, t] e Q q (M). 



The exterior product of £ = co + ip and § = cp + ix[r is defined by 

£ A £ = (co + irj) A (cp + ifi) 

= (co A cp — p A fi) + i(&) A^ + l)Af)). (8.31) 

The exterior derivative d acts on £ = + ip as 

d£ = d&> + id/?. (8.32) 

d is a real operator: df = den — i dp = df . 

Exercise 8.5. Let co e £2 q (M) c and £ e Q, r (M) ~. Show that 

0)A^ = (-lf?A(B (8.33) 

d(wA^) = den A I + (-l) 9 ©Ad?. (8.34) 

8.3.2 Differential forms on complex manifolds 

Now we restrict ourselves to complex manifolds in which we have the 
decompositions T p M c = T p M + © T p M~ and X(M) C = X(M) + © X(M)~ . 

Definition 8.2. Let M be a complex manifold with dim = /77 . Let co e 
Q q p (My~ ( q < 2m) and r , s be positive integers such that r + s = q. Let Vj- e 
T p M c (1 < i < q) be vectors in either T p M + or T p M~ . If <w(Vi, . . . , V^) = 0 
unless r of the V,- are in T p M + and .v of the V, are in T p M~, co is said to be of 
bidegree (r. s) or simply an (r, ,y)-form. The set of (r. ,v)-forms at p is denoted by 
Q. r p s (M). If an (r. ,y)-form is assigned smoothly at each point of M, we have an 
(r, s ) -form defined over M. The set of (r, .v) -forms over M is denoted by Q r s (M ). 

Take a chart (U, cp) with the complex coordinates (p{p) = z 'L We take the 
bases (8.13) for the tangent spaces T p M ± . The dual bases are given by (8.14). 
Note that dz^ is of bidegree (1, 0) since (d;;^, d/dz v ) = 0 and d Tfi is of bidegree 
(0, 1 ). With these bases, a form co of bidegree (r, s) is written as 

co — -j—(Om... l i, r v 1 ...v s dz^ 1 A ... A dz^ A dZ 1 ’ 1 A ... A dZ y * . (8.35) 

r\ s ! 

The set {dz Ml A ... A dz Mr A dZ y ’ A ... A dZ yj } is the basis of £2 r p s (M). The 
components are totally anti- symmetric in the // and v separately. Let z 1 ' and w 1 ' 
be two overlapping coordinates. The reader should verify that an (r, ,v)-form in 
the zj' coordinate system is also an (r. ,v)-form in the w v system. 

Proposition 8.1. Let M be a complex manifold of dime Af = m and o> and § be 
complex differential forms on M. 


(a) If co e Q. q ’ r (M) then co e Q. r ' q (M). 

(b) If w e £2 q,r (M) and § e £2 q ' y (M), then co A $ e Q q+q '’ r+r ' (M). 



(c) A complex q - form co is uniquely written as 

o) — X w (r ’ s ^ (8.36a) 

r+s=q 

where e Q r - S (M). Thus, we have the decomposition 

Q. q {M) c = 0 £l r,s (M). (8.36b) 

r+s=q 


The proof is easy and is left to the reader. Now any g-form &> is decomposed 
as 


a= x oj(r ’ s) 

r+s=q 

= X -Y- } 0 ) fll ... flr v 1 ...v s dz fl1 A... Adz^ Ad ?' 1 A... AdI Vl 
r+s=q r S - 

(8.37) 


where 


Vll\...IA r V\...Vs 


a 9 


’ ’ 9 z^r ’ g z y i ’ ’ g z v * 

Exercise 8.6. Let dim R M — m. Verify that 


(8.38) 


dimjj Q.'p S (M) 


it'll < r, s < m 


otherwise. 


Show also that dim R Sl q p {M) c = J2 r + S =q dim K &p {M) = ( 2 ™). 


8.3.3 Dolbeault operators 

Let us compute the exterior derivative of an (r, ,vj-fomi a>. From (8.35), we find 
1/9 a 9 -A 

d<w ~~ I dz + ^=A W W-MrV 1 ...v s dz 

x dz M1 A ... A d z^ r A dz 1 ' 1 A ... A dz Vs . (8.39) 

d a) is a mixture of an (r + 1, .y )-form and an (r, s + l)-form. We separate the 
action of d according to its destinations. 


d= 9 + 9 


(8.40) 



where 9 : £T'- S (M) -» £2 r+l ' s (M) and 9 : Q rs (M) -> £i r ’ s+l (M). For example, 
if m — A dZ 1 ’, its exterior derivatives are 

9m = —y-d z k A d z^ A dZ v 

9z A 

9m = y-dZ A A dz M A dZ y = y ~<\ A dZ A A dZ y . 

9Z X dz x 

The operators 9 and 9 are called the Dolbeault operators. 

If a) is a general q - form given by (8.37), the actions of 9 and 9 on m are 
defined by 

9m = ^ 9ft/ r ' s) 9m = ^ 9m ( ' vs) . (8.41) 

Theorem 8.1. Let M be a complex manifold and let m e Q r/ (M ) and if e 
QP(M) C . Then 


99m = (99 + 99)m = 99m = 0 

(8.42a) 

9m = 9m, 9m = 9m 

(8.42b) 

9(m A §) = 9m A £ + (— l) 9 m A 9£ 

(8.42c) 

9(m A §) = 9m A § + (— l) 9 m A 9£. 

(8.42d) 


Proof. It is sufficient to prove them when o> is of bidegree (r. 5 ). 

(a) Since d = 9 + 9, we have 

0 = d“m = (9 + 9) (9 + 9)ft; = 99ft> + (99 + 99)ft) + 9 9ft). 

The three terms of the RHS are of bidegrees (r + 2, ,v), (r + I . ,v + 1) 
and (r, ,v + 2) respectively. From proposition 8.1(c), each term must vanish 
separately. 

(b) Since dm = dm, we have 

9m + 9m = dm = (9 + 9)m = 9m + 9m. 

Noting that 9m and 9m are of bidegree (j + 1, r) and 9m and 9m are of 
( 5 , r + 1), we conclude that 9m = 9m and 9m = 9m. 

(c) We assume m is of bidegree (r, s) and £ of (r', s')- Equation (8.42c) is 

proved by separating d (m A f ) = dm A f + (— l) 9 m A df , into forms of 
bidegrees (r + r' + 1, s + s') and (r + r' , s + s' + 1). □ 

Definition 8.3. Let M be a complex manifold. If m e f2 r,0 (M) satisifies 9m = 0, 
the r-form m is called a holomorphic r-form. 



Let us look at a holomorphic 0-form f e 3~(U) on a chart (U, <p). The 
condition 3/ = 0 becomes 


— 7 - = 0 1 < X < m — dim -- M. 

dz x ~ ~ 


(8.43) 


A holomorphic 0-form is just a holomorphic function, / e 'J(Uy 
£2 r,0 (M), where 1 < r < m — dime AL On a chart ( U , <p), we have 


Let co e 


1 


00 = —co 
r\ 


Ml --Mr 


d Z m A ... A dz Mr . 


(8.44) 


Then dco = 0 if and only if 

3 

— -Mr — 0 

oz 

namely if co are holomorphic functions on U. 

Let dime A/ — in. The sequence of C-linear maps 


Q r '°(M) ■■■ 

a -> Q r ’ m - ] (M) Q r ' m (M) (8.45) 

is called the Dolbeault complex. Note that 3" = 0. The set of 3-closed (r, s)- 
forms (those co e Q l s (M) such that dco = 0) is called the (r, v)-cocvcle and is 
denoted by Z'- s (M). The set of 3-exact (r, i)-forms (those co e Q, r ’ s (M) such 
that co — dq for some q e (M)) is called the (r, ,v)-coboundarv and is 

denoted by ii’- s (M). The complex vector space 

Hj S {M) = Zj S (M)/ Bj S (M) (8.46) 

is called the (r, s )th 3-cohomology group, see section 8 . 6 . 


8.4 Hermitian manifolds and Hermitian differential geometry 

Let M be a complex manifold with dime Af — m and let g be a Riemannian 
metric of M as a differentiable manifold. Take Z = X+iY, W — U + i V e T p M 


and extend g so that 

8 P (Z, W) = g p (X, U) - g p (Y, V) + i [g p (X, V) + g p (Y, U)]. (8.47) 

The components of g with respect to the bases (8.13) are 

g l *v(j>) = g P (d/dz lt ,d/dz v ) (8.48a) 

8Mp) = SpO/dz^d/dr) (8.48b) 

gMP) = 8 P (9/dz tl ,d/dz v ) (8.48c) 

gjLv(p) — gpid/dz 11 , d/dz, v )- (8.48d) 



We easily verify that 


g/XV — gvfX, gp.V — gvp . ! gfXV — gvp.1 gfJLV — gfXV > g^LV — g /XV ■ 

(8.49) 

8.4.1 The Hermitian metric 

If a Riemannian metric g of a complex manifold M satisfies 

gpUpX, J p Y ) = g p (X, Y) (8.50) 

at each point p e M and for any X . Y e T p M. g is said to be a Hermitian metric. 
The pair (M, g) is called a Hermitian manifold. The vector J p X is orthogonal 
to X with respect to a Hermitian metric, 

gpiJpX, X) = gp(JpX, JpX ) = - gp(JpX , X) = 0. (8.51) 

Theorem 8.2. A complex manifold always admits a Hermitian metric. 


Proof. Let g be any Riemannian metric of a complex manifold M. Define a new 
metric g by 

g p (X, Y) = \\g p {X, Y) + g p (JpX, JpY)]. (8.52) 

Clearly g p (J p X, J p Y ) = gp(X, Y). Moreover, g is positive definite provided 
that g is. Hence, g is a Hermitian metric on M. □ 

Let g be a Hermitian metric on a complex manifold M. From (8.50), we find 

that 

/ 9 3 \ 

8 yJzi 1 ' 7 3 zV “ ~ 8 

hence g pv — 0. We also find that gj^j = 0. Thus, the Hermitian metric g takes 
the form 

g = giivdz 11 ® dz y + gjivdl. 11 ® di y . (8.53) 

[Remark: Take X.Y <= T p M + . Define an inner product h p in T p M + by 

h p (X, Y) = g p (X, Y). (8.54) 

It is easy to see that h p is a positive-definite Hermitian form in T p M + . In fact, 




gpv — g 


3 3 

3 ’ 3 z v 


h(X, Y) = g(X, Y) = g(X, Y) = h(Y. X) 

and h(X, X) = g(X,X) = g(Xi, Xi) + g(X 2 , X 2 ) > 0 for X = X x +iX 2 . This 
is why a metric g satisfying (8.50) is called Hermitian.] 



8.4.2 Kahler form 


Let (M, g) be a Hermitian manifold. Define a tensor field Q whose action on 
XJe T P M is 


n p (X,Y) = gp(J p X,Y) X,Y e T p M. (8.55) 

Note that £2 is anti-symmetric, £2(Z, Y) = g(JX,Y) — g(J 2 X, JY) — 
—g(JY, X) = —Q(Y, X). Hence, Q. defines a two-form called the Kahler form 
of a Hermitian metric g. Observe that O is invariant under the action of J. 

£2(JX, JY) = g(J 2 X, JY) = g(J 3 X, J 2 Y) = £2(X, F). (8.56) 

If the domain is extended from T p M to TpM ' 2 , Q is a two-form of bidegree 
(1, 1). Indeed, for the metric (8.53), it is found that 


/ 9 9 \ / 9 9 \ 

\9z^ ’ 9z v / \ 9z M ’9z v / 


= i g^v = °- 


We also have 




a a \ 

d¥’ d?) 


= o, 


(-L ,±) 

{dz^ dz v J 


— i §nv — ^ 


9 9 \ 

dz v ’ 9z^ / 


Thus, the components of £2 are 


Qpv — £2//y 0 


fJ,V 




Vfj, — Iguv 


(8.57) 


We may write 

O = ig pV dz^ ® dz y - \gvn dz y ® d z^ 1 = i gpy d z' 4 A dz y . (8.58) 

£2 is also written as 

£2 = - y /xTJ dz M a dz y (8.59) 

where J pV = g^J X v = -i gfiv- ^ is a real form; 

£2 = -ig^dz^ A dz y = igvp dz y A dz^ 1 = £2. (8.60) 

Making use of the Kahler form, we show that any Hermitian manifold, and 
hence any complex manifold, is orientable. We first note that we may choose an 
orthonormal basis {ei, Je\. . . . , e m , Je m ). In fact, if g(e i, e\ ) = 1, it follows 
that g(Je i, Je\) — g(e i, ei) = 1 and g(e i, Je\) = —g(Je i, e\) = 0. Thus e\ 
and Je\ form an orthonormal basis of a two-dimensional subspace. Now take ej 
which is orthonormal to e\ and Je\ and form the subspace [ej- Je 2 }. Repeating 
this procedure we obtain an orthonormal basis {ei , J e\ , ... , e m . Je m }. 


Lemma 8.1. Let £2 be the Kahler form of a Hermitian manifold with dim-; M — 
m. Then 

A , . , A 
m 


is a nowhere vanishing 2m -form. 



Proof. For the previous orthonormal basis, we have 


£2(e;, Jef) — g(Jej, Jef = Sq Q(ei,ef = £2(7e;, Jef — 0. 


Then it follows that 

ft A ... A £2 (ei , J e \ , . . . , e m , Je m ) 
m 

= ^2 &{ep( i), Tep(i)) . . . Q(ep( m ), Jep( m) ) 
p 

= m!£2(e i, /ei) . . . £2(e m , Je m ) = /«! 

where P is an element of the permutation group of m objects. This shows that 
£2 A ... A £2 cannot vanish at any point. □ 

Since the real 2m -form £2 A ... A £2 vanishes nowhere, it serves as a volume 
element. Thus, we obtain the following theorem. 

Theorem 8.3. A complex manifold is orientable. 


8.4.3 Covariant derivatives 

Let (M, g) be a Hermitian manifold. We define a connection which is compatible 
with the complex structure. It is natural to assume that a holomorphic vector 
V e T p M + parallel transported to another point q is, again, a holomorphic vector 
V (q) e T q M + . We show later that the almost complex structure is covariantly 
conserved under this requirement. Let {z 11 } and { z ^ + Az M } be the coordinates of 
p and q, respectively, and let V — V^d/dz^lp and V (q) — V^(z + Az)d/dz fl \q. 
We assume that (cf (7.9)) 

V^z + A Z ) = V^iz) - y A (z)r' i y x(z)Az y . ( 8 . 61 ) 


Then the basis vectors satisfy (cf (7.14)) 

9 

' dz v 

Since d/dz ^ is a conjugate vector field of 9/9z /i , we have 


— r fiviz) „ j • 


9 

’ dz x 


9 


9 


v — r A 

V ^dz v ~ 


(8.62a) 


(8.62b) 


where r A p;y = r A |ttv . F A M y and F A p;y are the only non-vanishing components of 
the connection coefficients. Note that V /i 9/9z v = Vjj-9/9z y = 0. For the dual 
basis, non-vanishing covariant derivatives are 

Vp dz v = -r V dz X Vjrdz v = -T** z\ 


(8.63) 



(8.64) 


The covariant derivative of X + = X^d/dz* 1 e X(M) + is 

0 

v,i + = (a,i l + rrV)^ 

where 3^ = d/dz^ ■ For X~ — X^d/dz 1 * e X(M)~ , we have 

V„X~ = d^X 1 ^ (8.65) 

az 

since T^y = F^y = 0. As far as anti-holomorphic vectors are concerned, 
works as the ordinary derivative 3^ . Similarly, we have 

V^X + = ^X x ^~ (8.66) 

o 

V F *“ = @izX x + X v r X j[ V )-^. (8.67) 

It is easy to generalize this to an arbitrary tensor field. For example, if t — 

tnJdz 11 ® dx v <g> 3/3 z X , we have 

(^/cO/XV — ?£y T^ Kfl tji^ r^y 

— 9f/u,v T" I/iv^r — 

We require the metric compatibility as in section 7.2. We demand that 
V K gfj,v = ^Tgixv = 0. In components, we have 

3 K g m - gxvT k Kll = 0 3 rguv - g^Tzjz = 0 . ( 8 . 68 ) 

The connection coefficients are easily read off: 

rV = g vx d KgllV r x F y = (8.69) 

where {g vA } is the inverse matrix of g g^g kv = 8/, g vX gXji = & v jr- A metric- 
compatible connection for which T (mixed indices) = 0 is called the Hermitian 
connection. By construction, this is unique and given by (8.69). 

Theorem 8.4. The almost complex structure J is covariantly constant with respect 
to the Hermitian connection, 


(V*/) ^ = (V F /)y^ = (V,/y* = (V F /V‘ = 0. (8.70) 


Proof. We prove the first equality. From (8.22), we find 

(v,./)y' i = 3*i5y^ - = o. 


Other equalities follow from similar calculations. 


□ 



8.4.4 Torsion and curvature 


The torsion tensor T and the Riemann curvature tensor R are defined by 


We find that 


T(X, Y) = V X Y - V Y X - [X, Y ] 

R(X, Y)Z = V x VyZ - V F V X Z - V [X ,F]Z. 


3 3 

QyfJ, ’ 

3 3 

3 ’ 3z y 
3 3 

3F’ 3T 


r( — . — ) =(rV-r\ #i )g ? 


3 3 

3z M ’ 3z v , 


= T 


— — \ 

— t 1 MW 1 VfJ.) x - 

dz 


The non-vanishing components are 


T k (IV — r^/rv r^u/i — g^(B(lg v j: dvg/jjj) 

T jzv =r jxv r — g ^ (9jrguf — <h gjit;) ■ 


As for the Riemann tensor, we find, for example, that 


(8.71) 

(8.72) 


(8.73a) 

(8.73b) 


R k 


k/iv 


= 3 m r k 


vk ' 


3„rv + r 


vk A fir] 


H„xr 


/xA A vrj • 


If (8.69) is substituted, we find that 


tfV = ^g lK dvg xl + g&BvBvgxf - dvg^d^s - g lK d„d v g x f 

+ g^Bvg^g^Bpg^ - g^Bug^g^Bvg^ = 0 


where use has been made of the identity g^B^g — —g^jd^g^ etc. In general, 
we find that 

rK xab = r *iab = R a BkX = R A B1 a = 0 (8.74) 

where A and B are any (holomorphic or anti-holomorphic) indices. As a result, 
we are left only with the components R K X-JZv, R K X„v, RK XjZv and RK X„v- Note that 
we have a trivial symmetry R K xjiv = — RK Xvjc- So the independent components 
are reduced to R K xjiv and RK XlxV — RK xjcv- We find that 

RK XJZv = 3jj-r%A = Bjzig^^dvg^ j) 

R %v = ^vX = W^g^). 


(8.75a) 

(8.75b) 



Exercise 8. 7. Show that 


R kX/av — 8 k% X/xv — &]ld v gXic g ^ &jZ8ic%dv8X.ri ( 8 . 76 a) 

R Kk& - 8 k$ rI Ihv = d ^gkK - g^^g^glrj ( 8 - 76b ) 

RkX/xv — gic^ R ^ X/xv — RkX~ii ( 8 . 76 c) 

R <xjzv =S k J rH xtcv = ~ R kIvTl- ( 8 . 76 d) 

Verify the symmetries 

R kXJIv = —RxkJZv R kX/j,v — — R Ikij.v- ( 8 . 77 ) 

Let us contract the indices of the Riemann tensor as 

= R k KI iv = = - 9 v 3 ^ log G ( 8 . 78 ) 

where G = det(y /( jr) = v /g. To obtain the last equality, we used an identity 
SG = Gg^Sg^v'y see ( 7 . 204 ). We define the Ricci form by 

91 = d z* A dz 1 ’ = i 33 log G. ( 8 . 79 ) 


9t is a real form; 91 = — i331ogG = — i331ogG = 91. From the identity 
33 = — j d (3 — 3), we find 91 is closed; d9t oc d 2 (3 — 3) logG = 0. However, 
this does not imply that 91 is exact. In fact, G is not a scalar and (3 — 3) log G is not 
defined globally. 91 defines a non-trivial element ci(M) = [91/27T] £ H 2 (M\ M) 
called the first Chern class. We discuss this further in section 1 1.2. 

Proposition 8.2. The first Chern class c\{M) is invariant under a smooth change 
of the metric g -> g + Sg. 


Proof. It follows from (7.204) that 3 log G = g llv 8 g^. Then 

391 = 3i33 log G = u)dg^8g llV = -±d (3 - d)ig^8 gflV . 

Since g^Sg^y is a scalar, &> = —^(3 — d)g llv 8g l xv is a well-defined one-form on 
M. Thus, 391 = d co is an exact two-form and [91] = [91 + 391], namely c\{M) is 
left invariant under g -» g + 8g. □ 

8.5 Kahler manifolds and Kahler differential geometry 
8.5.1 Definitions 

Definition 8.4. A Kahler manifold is a Hermitian manifold (M, g) whose Kahler 
form is closed: df2 = 0. The metric g is called the Kahler metric of M. 
[ Warning : Not all complex manifolds admit Kahler metrics.] 



Theorem 8.5. A Hermitian manifold (M, g) is a Kahler manifold if and only if 
the almost complex structure J satisfies 

V /X i = 0 (8.80) 

where V /x is the Levi-Civita connection associated with g. 


Proof. We first note that for any /--form a>, d co is written as 

d co = Va) = —V^cov dx^ A d.r yi A ... A d.r lv . (8.81) 

[For example, 

VQ, — j V^£2 |Ul ,d.i: x A d.r M A d.r 1 ’ 

= - r' c A. v ^ lttK )ck x Adi f ‘ A(k v 

= j3x^ / , ty d.r x A dx M A d.r y = d£2 

since T is symmetric.] Now we prove that Vj, ./ — 0 if and only if V /x Q = 0. We 
verify the following equalities: 

(V Z ^)(X, Y) = V Z [C2(X, Y)] - Q(X Z X, Y ) - S2(X, V Z Y) 

= V z [g(JX, Y )] - g(JV z X, Y) - g(JX, V Z F) 

= (X z g)(JX, Y) + g{V z JX , Y) - g(JV z X , Y) 

= g(VzJX - JVzX, Y) = g((y z J)X, Y) 

where V z g = 0 has been used. Since this is true for any X, Y. Z, it follows that 
V z £2 = 0 if and only if V z / = 0. □ 

Theorems 8.4 and 8.5 show that the Riemann structure is compatible with 
the Hermitian structure in the Kahler manifold. 

Let g be a Kahler metric. Since dff = 0, we have 

O+aiigpdz^Adf 

= '^xg^v dz x A d Z 11 A dz 1 ’ + i3 jg^v dz x A d z M A dz v 
= jitfiguv - 3 ngxu) dz A A dz^ A dz 1 ’ 

+ 71 ( 3 xgp-v - 3f^ m i) dz A A dz" A dz v = 0 

from which we find 

dg/j.v hg)— hg/jjj dgpx 

dz x dz M dz x 3z y 


(8.82) 



Suppose that a Hermitian metric g is given on a chart (7; by 


gfiv — d^dvIC, 


(8.83) 


where /C, e 3 '(£/;). Clearly this metric satisfies the condition (8.82), hence it is 
Kahler. Conversely, it can be shown that any Kahler metric is locally expressed 
as (8.83). The function /C, is called the Kahler potential of a Kahler metric. It 
follows that £2 = iddICi on Uj. 

Let (Ui, q>i) and ( Uj , (fij) be overlapping charts. On Uj IT Uj, we have 


9 9 

3z ;t 9z l 


K'-i dz^ 1 <g> dz y 


9 9 

dw a dm? 


JC/ d w a <8> du>^ 


where z = <Pi(p ) and w — <Pj(p ). It then follows that 


9 w a dujP d 9 3 3,. 

a „ v a „ (8-84) 

dz p oz aw 01 dw p oz p az 

This is satisfied if and only if ICj(w,w ) = /C, (z, z) + <pij (z) + V' 7 / (z) where </>,-/ 
(\j/jj ) is holomorphic (anti-holomorphic) in z. 


Exercise 8.8. Let M be a compact Kahler manifold without a boundary. Show 
that 

Q m = Q A . . A Q 
m 

is closed but not exact where m = dime M [Hint. Use Stokes’ theorem.] Thus, 
the 2mth Betti number cannot vanish, b 2m > 1 . We will see later that b 2p > 1 for 
1 < p < m. 


Example 8.6. Let M — O" = {(z 1 , . . . , z m )}. C” is identified with M 2m by the 
identification z p — > x p + i y p . Let 8 be the Euclidean metric of K 2 "' , 


\ dx 11 dx v ) 

(-■-) 

\ dx IJ - dy v ) 


\dy t 1 dy v J 


= 8 


/IV 


(8.85a) 


Noting that Jd/dx p — d/dy p and Jd/dy p — —d/dx p , we find that 8 is a 
Hermitian metric. In complex coordinates, we have 


8 

8 


9 9 

dz p ’ 3 z v 

9 9 

dz p ’ 3z y 


= 9 


= 8 


9 9 

SfF' 3 F 
9 9 

3 z^ ’ 3z y 



(8.85b) 



The Kahler form is given by 


. Ill . Hi 

Q = d z 11 A dz^ = X - dx^ A dv^. (8.86) 

fl= 1 11= 1 

Clearly, dC = 0 and we find that the Euclidean metric S of M 2 "' is a Kahler metric 
of C™ . The Kahler potential is 


/c = (8.87) 

The Kahler manifold C” is called the complex Euclid space. 

Example 8. 7. Any orientable complex manifold M with dimy M = 1 is Kahler. 
Take a Hermitian metric g whose Kahler form is Q. Since C is a real two-form, a 
three-form df2 has to vanish on M. One-dimensional compact orientable complex 
manifolds are known as Riemann surfaces. 


Example 8.8. A complex projective space <CP m is a Kahler manifold. Let 
(U a , <p a ) be a chart whose inhomogeneous coordinates are (fi a (p) = £7,, n/a 
(see example 8.3). It is convenient to introduce a tidier notation {C y (a)|l < v < 
m } by 

*V) = ?”(«) (V < a - 1) r + V> = CV) (v>a). (8.88) 

{£ v ( a )} is just a renaming of {f v (a)}. Define a positive-definite function 


m+1 


/c a (/>) = £icV)Cp)i 2 + i = E 


V=1 V=1 

At a point p e U a C\Up, JC a ip) and lCp(p) are related as 

s |2 


(8.89) 


JCa(p) — 


1C p{p). 


(8.90) 


Then it follows that 


log ICo/ — log Kip T log T log . (8.91) 

Since z? /z a is a holomorphic function, we have 3 log z^ /z a = 0. Also 


3 log z^/z 01 = 3 log z& /z a = 0. 


Then it follows that 


33 log IC a — 33 log ICp. 


(8.92) 



A closed two-form £2 is locally defined by 


£2 = i33 log /C a . 


(8.93) 


There exists a Hermitian metric whose Kahler form is £2. Take X, Y e 
T p CP n and define g : T p CP n <g> T p CP n -» ffi by g(X, Y) = Q(X, JY). To 
prove that g is a Hermitian metric, we have to show that g satisfies (8.50) and is 
positive definite. The Hermiticity is obvious since g(JX , JY) = —Q(J X . Y) — 
£2(T, JX) — g(X , Y). Next, we show that g is positive definite. On a chart 
(Uc, (Pa), we obtain 


£2 = i 


3 2 log K, 

awr 


d^ L A di) l ’ 


(8.94) 


where we have dropped the subscript (a) to simplify the notation. If we substitute 
the expression (8.89) for 1C on U a , we have 


£2 = 


WEirf + iw'*? 


fl,V 


(El? 


X\2 . 


I) 2 


dO M A do 


(8.95) 


Let X be a real vector, X = X^d/d^ + X^d/d^ and JX = iX^d/d^ - 
iT' 4 3/30 M . Then 


g(X, X) = £2 (X, JX) = 2 £ Sj ^M 


fl, V 


(£I?¥ + D 2 


- a ^ i 


I0 A | 2 +! 


(^|o^| 2 + ! 


-2 


From the Schwarz inequality |Z /X | 2 • X! I^l 2 - X! I^^l 2 ’ we find the 

M A 

metric g is positive definite. This metric is called the Fubini-Study metric of 
CP" . 


A few useful facts are: 

(a) S 2 is the only sphere which admits a complex structure. Since S 2 — CP 1 , it 
is a Kahler manifold. 

(b) A product of two odd-dimensional spheres .S 2 "' +l x .S' 2,1+1 always admits a 
complex structure. This complex structure does not admit a Kahler metric. 

(c) Any complex submanifold of a Kahler manifold is Kahler. 


8.5.2 Kahler geometry 

A Kahler metric g is characterized by (8.82): 

dgfiv 9,?Ai7 9g//v ^8 p x 

dz x dz M dz X dz V 



This ensures that the Kahler metric is torsion free-. 

T\ v = / x (9^ vf - d v g IA p = 0 (8.96a) 

T* w = g ^ Gfigvt - dvgm) = 0. (8.96b) 

In this sense, the Kahler metric defines a connection which is very similar to the 
Levi-Civita connection. Now the Riemann tensor has an extra symmetry 

R K ',,,v = -dvig^d^j) = -Mg lK dig^) = R\ xv (8.97) 

as well as those obtained from (8.97) by known symmetry operations, 

= ^v = ^, R%v= R %x- (8.98) 

The Ricci form 91 is defined as before, 

9t = -i^logGdz'* Adz". 

Because of (8.97), the components of the Ricci form agree with Ric^', 91,,- = 
R K [IV — R K ILK V — Ric /IV . If Ric = y\= 0, the Kahler metric is said to be Ricci 
flat. 

Theorem 8.6. Let (M, g) be a Kahler manifold. If M admits a Ricci flat metric h, 
then its first Chern class must vanish. 


Proof. By assumption, 91 = 0 for the metric h. As was shown in the previous 
section, (R(g) — 91(/0 = 91 (g) — dco. Hence, c\(M) computed from g agrees 
with that computed from h and hence vanishes. □ 

A compact Kahler manifold with vanishing first Chern class is called a 
Calabi-Yau manifold. Calabi (1957) conjectured that if ci(M) = 0, the Kahler 
manifold M admits a Ricci-flat metric. This is proved by Yau (1977). Calabi-Yau 
manifolds with dime M — 3 have been proposed as candidates for superstring 
compactification (see Horowitz (1986) and Candelas (1988)). 

8.5.3 The holonomy group of Kahler manifolds 

Before we close this section, we briefly look at the holonomy groups of Kahler 
manifolds. Let (M.g) be a Hermitian manifold with dim'; M = in. Take a 
vector X e T p M + and parallel transport it along a loop c at p. Then we end up 
with a vector X' e T p M + where X ^ = X^hv 11 . Note that V does not mix the 
holomorphic indices with anti-holomorphic indices, hence X' has no components 
in T p M~ . Moreover, V preserves the length of a vector. These facts tell us that 
( h p v (c )) is contained in U (m) C 0(2 m). 

Theorem 8.7. If g is the Ricci-flat metric of an m -dimensional Calabi-Yau 
maifold M, the holonomy group is contained in SU(/«). 




p 


Figure 8.5. X e T p M + is parallel transported along pqrs and comes back as a vector 
X' e T p M+. 

Proof. Our proof is sketchy. If X — X^'d/dz 11 e T p M + is parallel transported 
along the small parallelogram in figure 8.5 back to p, we have X' e T p M + whose 
components are (cf (7.44)) 

X' 1 = X>‘ + X v R'\ Kl e K t (8.99) 

from which we find 

V = S/ + R v llxI s K t. (8.100) 

U (m) is decomposed as U(/«) = SU(m) x U(1 ) in the vicinity of the unit element. 
In particular, the Lie algebra u(m) = T e (U(m)) is separated into 

u(m) = su(m) © u(l). (8.101) 

su (in) is the traceless part of u (m) while u(l) contains the trace. Since the present 
metric is Ricci flat, the u(l) part vanishes, 

= 0 . 

This shows that the holonomy group is contained in SU(m). | Remark: Strictly 
speaking, we have only shown that the restricted holonomy group is contained in 
SU(m). This statement remains true even when M is multiply connected.] □ 

8.6 Harmonic forms and 3 -cohomology groups 
The (r, v)th 3-cohomology group is defined by 

= Z r - S (M)/B r - S (M). 


( 8 . 102 ) 



An element \o>] e Hj S (M) is an equivalence class of 3-closed forms of bidegree 
(r, s ) which differ from m by a 3 -exact form, 

[co] = {r) e SY' s (M) \dr) = 0, (o - r] = Hf, f e £2'' S_1 (M)}. (8.103) 

Clearly HP S (M) is a complex vector space. Similarly to the de Rham 

cohomology groups, the 3-cohomology groups of C" are trivial, that is, all the 
closed (r, s) -forms are exact. The 3 -cohomology groups measure the topological 
non-triviality of a complex manifold M. 

8.6.1 The adjoint operators 9 ' and 9 

Let M be a Hermitian manifold with dime M — m. Define the inner product 
between a, /I e £2 r,s (M) (0 < r, s < m) by 

(a,P) = I aA*/3 (8.104) 

Jm 

where * : Q , S (M ) -> Q m ~ r ’ m ~ s (M) is the Hodge * defined by 


*/3 = */3 = */3 (8.105) 

where *p is computed according to (7.173) extended to £2 r+s (M)^ . [ Remark : * 
maps an (r, ,s)-form to an (m — s, m — r)-form since it acts on a basis of Q r - S (M), 
up to an irrelevant factor, as 

* dz w A ... A d A dz Vl A ... A dz Vs ~ e* 1 '"^TZ r+l ..jz m e Vl '" Vs v s+ i...v m 
X d7' /ir+1 A ... A d lf lm A dz Us+1 A ... A d z Vm ■ 

Note that the above e-symbols are the only non-vanishing components in a 
Hermitian manifold. Now it follows that * : Q r ’ s (M) -> Q m ~ r ’ m ~ s (M).] 

We define the adjoint operators 3 1 and 3 of 3 and 3 by 

(a, 3/3) = (3V /3) (a, 3/3) = (fa, p). (8.106) 

The operators 3 f and 3 change the bidegrees as 3^ : Q r - S (M) — > £2 r-1 ’ s (M) 
and 3^ : Q, r - S (M) — >■ Q, r ' s ~ l (M). Clearly d : = 3^ + 3*. Noting that a 
complex manifold M is even dimensional as a differentiable manifold, we have 
(see (7.184a)) 

d f = -*d*. (8.107) 

Proposition 8.3. 

3 t = — * 3*, 3 f = - * 3 * . (8.108) 



Proof. Let co e fT' l,s (M) and i// e Q r ' s (M). If we note that co A * 1 // e 
S2'"-l’ m (M) and hence 3(a) A*\f) — 0, we find that 

d (ft) A *l/r) = 3(« A *l/r) = 3ft) A *l/r + (— l)' + ' s_1 ft) A 

= dco/\*f + (-l) r+ * _1 G> A (-iy + * +1 **3(*i/r) 

= 3ft) A *l/r + ft) A * *3*l/f (8.109) 

where use has been made of the facts 3*i/r e **/) — * * ft and 

(7.176a). If (8.109) is integrated over a compact complex manifold M with no 
boundary, we have 

0 = (3ft>, xf) + («, *3*i Jr). 

The second term is 

(ft), *3 *\[r) — (co, *3 * i/r) = (ft), *3 * i [r). 

We finally find 0 = (3ft), i f) + (co, *3 * f), namely 3 f = — * 3*. The other 
formula 3 = — * 3* follows similarly. □ 

As a corollary of proposition 8.3, we have 

(3 t ) 2 = (3^) 2 = 0. (8.110) 


8.6.2 Laplacians and the Hodge theorem 

Besides the usual Laplacian A = (dd * + d : d), we define other Laplacians Aa and 
Ag- on a Hermitian manifold, 

Aa = (3 + 3 t ) 2 = 33 f + 3 f 3 (8.111a) 

Ag = (9 + 3 t ) 2 = 33 + + 3 f 3 . (8.111b) 

An (r, ,v)-form co which satisfies A geo = 0 (Ag -co = 0) is said to be 3-harmonic 
(3-harmonic). If Aa co — 0 (Agft> = 0), co satisfies 3co = d^co = 0 (3 co = 3 ^ co = 
0). 

We have the complex version of the Hodge decomposition. Let Harm^'(M) 
be the set of 3-harmonic (r, ,v)-forms. 

HarmL s (M) = {co e Q r ' s (M)\A^co = 0}. (8.112) 

O 0 

Theorem 8.8. (Hodge’s theorem) Q r ' s (M) has a unique orthogonal decomposi- 
tion: 

£2™(M) = 3Q" _1 (M) © 3 t ft r ’ s+1 (M) © Harm & S (M) 

O 


(8.113a) 



namely an (r, i)-form co is uniquely expressed as 


co = da + 3 t /i + y (8. 1 13£>) 

where a e Q r - S -'(M), p e Q r ’ s+ '(M) and y e Harm j(M). 

The proof is found in lecture 22, Schwartz (1986), for example. If co is 3- 
closed, we have 3&> = 3 3^ = 0. Then 0 = {P,dd*P) = (3 t / 6,3 t / 6> > 0 
implies 3 ^ p = 0. Thus, any closed ( r , .S’) -form co is written as co = y + da, 
a e Q r - s ~ l (M). This shows that Hf’ s (M) C Harmh'(M). Note also that 

Harm Q. s (M) C Z- s (M) since dy — 0 for y e Harm^'(M). Moreover, 
Harmii' s (M) fl = 0 since Bj S (M) = 3 £2'' S_1 (M) is orthogonal to 

Harnv'(;V/). '1’hen it follows that Harmh v (/W) = H^ S {M). If P : Q. r ' s (M) -> 

3 3 3 

Harm^'(M) denotes the projection operator to a harmonic (r, 5 ) -form, [&>] e 
H'- s (M) has a unique harmonic representative / > o> e Harm^'(M). 

8.6.3 Laplacians on a Kahler manifold 

In a general Hermitian manifold, there exist no particular relationships among 
the Laplacians A, A ;) and A-. However, if M is a Kahler manifold, they are 
essentially the same. [Note that the Levi-Civita connection is compatible with the 
Hermitian connection in a Kahler manifold.] 

Theorem 8.9. Let M be a Kahler manifold. Then 

A = 2A 9 = 2Ag. (8.114) 

The proof requires some technicalities and we simply refer to Schwartz 
(1986) and Goldberg (1962). This theorem puts constraints on the cohomology 
groups of a Kahler manifold M. A form co which satisfies dco — d^ co — 0 
also satisfies dco — d^co — 0. Let co be a holomorphic p- form; dco = 0. 
Since co contains no dz^ in its expansion, we have 3 ' co = 0, hence Ag-&> = 

(33^ + 3 ^3)&) = 0. According to theorem 8.9, we then have A co — 0, that is any 
holomorphic form is automatically harmonic with respect to the Kahler metric. 
Conversely Aco = 0 implies dco — 0, hence every harmonic form of bidegree 
( p , 0) is holomorphic. 

8.6.4 The Hodge numbers of Kahler manifolds 

The complex dimension of H^ s (M) is called the Hodge number b r,s . The 
cohomology groups of a complex manifold are summarized by the Hodge 



diamond, 

/ 


j^m,m 


un,m — \ 


Lm — l.m 


0 1,1 


,l,m— 1 ^0,m 


V 


b Q,0 


(8.115) 


) 


These ( m + 1)” Hodge numbers are far from independent as we shall see later. 


Theorem 8.10. Let M be a Kahler manifold with dime M — m. Then the Hodge 
numbers satisfy 


(a) b r ' s =b s ’ r (8.116) 

(b) b r,s = b m-r,m-s (8 . 117) 

Proof, (a) If co e Q r ’ s (M ) is harmonic, it satisfies A^oo — Ago) — 0. Then the 
(s, r)-form co is also harmonic, Afa> = 0 since Afa> = A = Ajco — 0 (note 
that Aa = Aj). Thus, for any harmonic form of bidegree (r, .v), there exists a 
harmonic form of bidegree (s, r) and vice versa. Thus, it follows that // , v = b SJ ' . 
(b) Let w e Q r,s (M) and f e (M). Then u> A \lr is a volume element 

and it can be shown (Schwartz 1986) that f M co A \[r defines a non-singular 
map Hf s {M) x H^~ r ’ m ~ s {M) -> C, hence the duality between Hf s {M) and 
(M). This shows that Hf s (M) is isomorphic to (M) as a 

vector space and it follows that dime //L 5 (M) — dime Hj~ r,m ~ s (M) hence 
b r,s = fr m — r ’ m ~ s Q 

Accordingly, the Hodge diamond of a Kahler manifold is symmetric about 
the vertical and horizontal lines. These symmetries reduce the number of 
independent Hodge numbers to (^m + l) 2 if m is even and j(m + 1 )(m + 3) 
if m is odd. 

In a general Hermitian manifold, there are no direct relations between the 
Betti numbers and the Hodge numbers. If M is a Kahler manifold, however, 
theorem 8.1 1 establishes close relationships between them. 

Theorem 8.11. Let M be a Kahler manifold with dim-; M — m and DM — 0. 
Then the Betti numbers b p (1 < p < 2m) satisfy the following conditions; 

(a) b p = ^ (8.118) 

r+s=p 

b 2p -' is even (1 < p < m ) 

b 2p >1 (1 < p < m) 


(b) 

(c) 


(8.119) 

( 8 . 120 ) 



Proof, (a) Hf s (M) is a complex vector space spanned by Ag--harmonic (r, s)- 
forms, H^ S (M) — {[w]\w e £2 r ’ s (M), A jw = 0}. Note also that, H P (M) 
is a real vector space spanned by A-harmonic p-forms, H P (M) = {[ft>]|&> e 
£2 / ’(M), Aw = 0}. Then the complexification of H P {M) is H P (M ) c = {[tw]|o> e 
£2^(M) C , Aw — 0}. Since M is Kahler, any form w which satisfies Ajw = Oalso 
satsihes Aw = 0 and vice versa. Since 


Q p (M) c = ® r+s=p Q r - s (M) 


we find that 

H p (M) c = ® r +s= P H rs (M). 

Noting that dimjj H P (M) — dime H P (M ) lL , we obtain b p = fZ r+s=p b r ' s . 
(b) From (a) and (8.1 16), it follows that 


b 2p ~ 1 


b r ’ s — 2 

r+s=2p—\ r+s=2p— 1 

r>s 


Thus, b 2p ~ l must be even. 

(c) The crucial observation is that the Kahler form £2 is a closed real two- 
form, d£2 = 0, and the real 2 p-form 

£2 P = £2 A ... A £2 
p 

is also closed, d£2 p = 0. We show that £2 ,;l is not exact. Suppose Q p — dp for 
some i] e £2 2p ~ l (M). Then £2 m = £2 m ~ p A £2 P = d (£2 m-p A p). It follows from 
Stokes’ theorem that 

[ £2'" = [ d(£2 m ~ p A p) = [ Q m - p A p = 0. 

JM Jm JdM 

Since the LHS is the volume of M, this is in contradiction. Thus, there is at least 
one non-trivial element of H 2p {M) and we have proved that b 2p >1. □ 

If a Kahler manifold is Ricci flat, there exists an extra relationship among 
the Hodge numbers, which further reduces the independent Hodge numbers, see 
Horowitz (1986) and Candelas (1988). 

8.7 Almost complex manifolds 

This and the next sections deal with spaces which are closely related to complex 
manifolds. These are somewhat specialized topics and may be omitted on a first 
reading. 



8.7.1 Definitions 


There are some differentiable manifolds which carry a similar structure to 
complex manifolds. To study these manifolds, we somewhat relax the condition 
(8.16) and require a weaker condition here. 

Definition 8.5. Let M be a differentiable manifold. The pair (M, J ), or simply 
M, is called an almost complex manifold if there exists a tensor field J of type 
(1,1) such that at each point p of M, Jj, = —id t p m- The tensor field J is also 
called the almost complex structure. 

Since J~ = —id t p m, Jp has eigenvalues ±i. If there are m + i, then there 
must be an equal number of — i, hence J p is a 2m x 2m matrix and ./^ = —hm- 
Thus, M is an even-dimensional manifold. Note that not all even-dimensional 
manifolds are almost complex manifolds. For example, S 4 is not an almost 
complex manifold (Steenrod 1951). Note also that we now require a weaker 
condition — —hm- Of course, the tensor J p defined by (8.16) satisfies 
Jp — — hm , hence a complex manifold is an almost complex manifold. There 
are almost complex manifolds which are not complex manifolds. For example, it 
is known that S 6 admits an almost complex structure, although it is not a complex 
manifold (Frohlicher 1955). 

Let us complexify a tangent space of an almost complex manifold (M, J). 
Given a linear transformation J p at T p M such that jj, — — hm , we extend J p to a 
C-linear map defined on T p M c . J p defined on T p M c also satisfies J~ = —hm, 

J p (X + i Y) = jjx + iJpY = -X + i(— F) = -(Z + iF) 

where X, Y e T p M. Let us divide T p M into two disjoint vector subspaces, 
according to the eigenvalue of J p , 

T p M c = T p M+ ®T p M~ (8.121) 

where 

T p M ± = {Z e T p M c \J p Z = ±iZ}. (8.122) 

Any vector V e T p M c is written as V = Wi + W 2 , where VFi, Wi e T p M + . 
Note that J p V — i IF 1 — i W 2 • At this stage the reader might have noticed that we 
can follow the classification scheme of vectors and vector fields developed for the 
complex manifolds in section 8.2. In fact, the only difference is that on a complex 
manifold the almost complex structure is explicitly given by (8.18), while on 
an almost complex manifold, it is required to satisfy the less strict condition 
jj, — —hm - To classify the complexified tangent spaces and complexified vector 
spaces, we only need the latter condition. Accordingly, we separate T p M into 
T p M ± and X(M) C into X(M) ± , although there does not necessarily exist a basis 



of T p M + of the form {3/3 z M }. For example, we may still define the projection 
operators 

P ± = 4(id TpM T U p ) : T p M c -> T p M ± . (8.123) 

We call a vector in T p M + ( T p M~ ) a holomorphic (anti-holomorphic) vector and 
a vector field in X(M) + (X(M)~) a holomorphic (anti-holomorphic) vector field. 

Definition 8.6. Let ( M , /) be an almost complex manifold. If the Lie bracket of 
any holomorphic vector fields X, Y e X + (M) is again a holomorphic vector field, 
[X . Y] e X + (M), the almost complex structure J is said to be integrable. 

Let (M, /) be an almost complex manifold. Define the Nijenhuis tensor 
field N : X(M) x X(M) -* X(M) by 

N(X, Y) = [X, Y] + J[JX, Y] + J[X , JY ] - [ JX , JY ]. (8.124) 

Given a basis { e ^ = d/dx 11 } and the dual basis {dx^}, the almost complex 
structure is expressed as J — Jp dx^ ® d/dx v . The component expression 
of N is 


N(X, Y) = (X v d V Y^ - Y v d v X^)ep 

+ J x *{J K v X K d v Y x - Y v d v (JK l X K )}ep 
+ J^{X v d v (J, x Y K ) - J K v Y K d v X k }e fl 

- {J K v X K d v (h ,l Y k ) - J K v Y K d v (h ,1 X k )\e ll 
= X K Y v [-h»{d v J K k ) + J) ll (d K J v k ) 

- J K k {dxJ x fi) + J v k (dxJ^)]e„. (8.125) 

Thus, N is indeed linear in X and Y and hence a tensor. If J is a complex 
structure, J is given by (8.18) and the Nijenhuis tensor field trivially vanishes. 

Theorem 8.12. An almost complex structure J on a manifold M is integrable if 
and only if N(A, B) = 0 for any A. B e X(M). 


Proof. Let Z — X + i Y, W — U + iV e X{M) C . We extend the Nijenhuis tensor 
field so that its action on vector fields in X(M) C is given by 

N(Z, W ) = [Z, W] + J[JZ , W] + J[Z, JW ] - [JZ. JW ] 

= {N(X, U) - N(Y, V)} + i {N(X, V) + N(Y, U)}. (8.126) 

Suppose that N(A, B) — 0 for any A, B e X(M). From (8.126), it turns 
out that N(Z, W) = 0 for Z.We X C (M). Let Z, W e X+(M) c X(M) C . 
Since JZ = iZ and JW = i W, we have N(Z, W) = 2{[Z, W] + U[Z, W]}. By 
assumption, N(Z, W ) = 0 and we find [Z, W] — — i J[Z, W ] or J[Z, W] — 



i[Z, W], that is, [Z, W] e X + (M). Thus, the almost complex structure is 
integrable. 

Conversely, suppose that J is integrable. Since ‘X ' ( M ) is a direct sum of 
X + (M) and X _ (M), we can separate Z, W e X C (M) as Z = Z + + Z~ and 
W — W + + W~ . Then 

N(Z, W) = N(Z + , W + ) + N(Z + , W~) + N(Z~, W + ) + N(Z ~ , W~). 

Since JZ ± — ±i Z ± and JW ± = ±i W ± , it is easy to see that N{Z + , W~) — 
N(Z~ , W + ) — 0. We also have 

N(Z + , W + ) = [Z+, W + ] + 7[iZ+, W + ] + J[Z + , i W + ] - [iZ+, i W + ] 

= 2 [Z+, W + ] - 2[Z+, W + ] = 0 

since J[Z + , W + ] — i[Z + , W + ], Similarly, N(Z~ , VT _ ) vanishes and we have 
shown that N{Z, W ) = 0 for any Z, W e X C (M). In particular, it should vanish 
forZ, WeX(M). □ 

If M is a complex manifold, the complex structure / is a constant tensor 
held and the Nijenhuis tensor held vanishes. What about the converse? We now 
state an important (and difficult to prove) theorem. 

Theorem 8.13. (Newlander and Nirenberg 1957) Let (M, J ) be a 2m -dimensional 
almost complex manifold. If J is integrable, the manifold M is a complex 
manifold with the almost complex structure J . 

In summary we have: 

Integrable almost Vanishing Nijenhuis _ , . , 

= _ ,, = Complex manifold, 

complex structure tensor held 

8.8 Orbifolds 

Let M be a manifold and let G be a discrete group which acts on M. Then the 
quotient space T = Mj G is called an orbifold. As we will see later there are 
hxed points in M, which do not transform under the action of G. These points 
are singular and the orbifold is not a manifold in general. Thus, even though we 
start with a simple manifold M, the orbifold M/ G may have quite a complicated 
topology. 

8.8.1 One-dimensional examples 

To obtain a concrete idea, let us consider a simple example. Take M — ffi 2 which 
is to be identihed with the complex plane C. Let us take G — Z 3 and identify 
the points z, e 27n /V anc | e 47rl / 3 ^ xhe orbifold M/G consists of a third of the 




Figure 8.6. The orbifold C/Z3 is a third of the complex plane. The edges of the orbifold 
are identified as shown in the figure. V becomes a vector V after parallel transportation 
along C. The angle between V and V is 2n /3. 


complex plane and after the identification of the edges we end up with a cone, 
see figure 8 . 6 . It is interesting to see what the holonomy group of this orbifold 
is. We use the flat connection induced by the Euclidean metric of C. Then, after 
the parallel transport of a vector V along the loop C (this is indeed a loop!), we 
obtain a vector V which is different from V after the identification. Observe that 
the angle between V and V is 2 tt/ 3. It is easy to verify that the holomony group 
is Z 3 . Since the holonomy is trivial for the loop Co which does not encircle the 
origin, we find that the curvature is singular at the origin (recall that the curvature 
measures the non-triviality of the holonomy, see section 7.3). In general the fixed 
points (the origin in the present case) are singular points of the curvature. Note, 
however, that C/Z 3 is a manifold since it has an open covering homeomorphic to 
II 2 . 


A less trivial example is obtained by taking the torus as the manifold. We 
identify the points z and z + m + ne 17r ' 3 (m, n e Z) in the complex plane; see 
figure 8.7(a). If we identify the edges of the parallelogram OPQR, we have the 
torus T 2 . Let Z 3 act on T 2 as a : z !-»• e 2jrl ' 3 z. We find that there are three 
inequivalent fixed points z = (n/«J 3)e 7r1 / 6 where n — 0. I and 2. This orbifold 
T = C/Z 3 consists of two triangles surrounding a hollow; see figure 8.7(h). If 
the flat connection induced by the flat metric of the torus is employed to define the 
parallel transport of vectors, we find that the holonomy around each fixed point is 
Z3. 




Figure 8.7. Under the action of Z 3 , points of the torus T 2 are identified. The shaded area 
is the orbifold T = T~ /'Lt,. If the edges of the orbifold are identified, we end up with the 
object in figure 8.1(b), which is homeomorphic to the sphere S 2 . 



Figure 8.8. The conical singularity. The origin does not look like R" or C” . 


8.8.2 Three-dimensional examples 

Orbifolds with three complex dimensions have been proposed as candidates for 
superstring compactification. The detailed treatment of this subject is outside the 
scope of this book and the reader should consult Dixson et al (1985, 1986) and 
Green et al (1987). 

Let T — C 3 /L be a three-dimensional complex torus, where L is a lattice 
in C 3 . For definiteness, let (zi, Z2, Z 3 ) be the coordinates of C 3 and identify Zi 
and Zi + hi + ne 771 / 3 . Under this identification, T is identified with a product of 
three tori, T — T\ x T 2 x Tj. T admits, as before, the action of Z 3 defined 



by b : jj h e 27ri / 3 z,'. If each Zi takes one of the values 0, (1/V3)e 17r / 6 , 
(2/\/3)e 7ri / 6 , the action of a leaves the point ( Zi ) invariant. Thus, there are 
3 3 = 27 fixed points in the orbifold. In the present case, the fixed point is a 
conical singularity (figure 8.8) and the orbifold cannot be a manifold. [Remarks: 
The appearance of the conical singularity can be understood more easily from a 
simpler example. Let (x, y) e C 2 and let Z2 act on C 2 as (x, y ) m* ±(x, y). 
Then the orbifold F = C 2 /Z 2 has a conical singularity at the origin. In fact, let 
[(x, y)] — >■ (x 2 , xy, y 2 ) = ( X , Y, Z) be an embedding of F in C 3 . Note that X, Y 
and Z satisfy a relation Y 2 — XZ. If X, Y and Z are thought of as real variables, 
this is simply the equation of a cone.] 
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FIBRE BUNDLES 


A manifold is a topological space which looks locally like TR m , but not necessarily 
so globally. By introducing a chart, we give a local Euclidean structure to a 
manifold, which enables us to use the conventional calculus of several variables. 
A fibre bundle is, so to speak, a topological space which looks locally like a direct 
product of two topological spaces. Many theories in physics, such as general 
relativity and gauge theories, are described naturally in terms of fibre bundles. 

Relevant references are Choquet-Bruhat etal (1982), Eguchi etal (1980) and 
Nash and Sen (1983). A complete analysis is found in Kobayashi and Nomizu 
(1963, 1969) and Steenrod (1951). 

9.1 Tangent bundles 

For clarification, we begin our exposition with a motivating example. A tangent 
bundle T M over an m -dimensional manifold M is a collection of all the tangent 
spaces of M: 



(9.1) 


psM 


The manifold M over which T M is defined is called the base space. Let { [/; } be 
an open covering of M. If x il — <pi ( p ) is the coordinate on 17/ , an element of 


TUi = U T p M 


psUj 


is specified by a point p e M and a vector V = V fl (p)( d/dx^)\ p e T p M. 
Noting that Ui is homeomorphic to an open subset cp(Ui) of TR m and each 
T p M is homeomorphic to K'", we find that T U, is identified with a direct 
product E' ,! x ffi m (figure 9.1). If {p, V ) £ TUi, the identification is given by 
(p, V ) i->- ( x^ip ), V^ip)). TUi is a 2m -dimensional differentiable manifold. 
What is more, T Ui is decomposed into a direct product t/,- x ffi m . If we pick up 
a point u of TUi, we can systematically decompose the information u contains 
into a point p e M and a vector V e T p M. Thus, we are naturally led to the 
concept of projection n : TUi Ui (figure 9.1). For any point u e TUi, 7t(u) 
is a point p e Ui at which the vector is defined. The information about the vector 


T p M = R m 



P 


U, s R m 


Figure 9.1. A local piece TUj ~ R m x R m of a tangent bundle TM. The projection tt 
projects a vector V e TpM to p. 


is completely lost under the projection. Observe that = T p M. In the 

context of the theory of fibre bundles, T p M is called the fibre at p. 

It is obvious by construction that if M = M'” , the tangent bundle itself is 
expressed as a direct product x W" . However, this is not always the case 
and the non-trivial structure of the tangent bundle measures the topological non- 
triviality of M. To see this, we have to look not only at a single chart Uj but also 
at other charts. Let Uj be a chart such that Ui fi U j ^ 0 and let y^ = if ip) be 
the coordinates on Uj. Take a vector V e T p M where p e Ui fi Uj. V has two 
coordinate presentations. 


n u 

v = y ^ — 

dx v 

It is easy to see that they are related as 


~ ii V 
— 

3y^ 


dy v 

yv = irn (p)Vfl - 

dx 11 


(9.2) 


(9.3) 


For {a: 77 } and [y v ] to be good coordinate systems, the matrix (GJj) = {dy v /dx^) 
must be non-singular: (GJj) e GL(m, M). Thus, fibre coordinates are rotated 
by an element of GL(/n , M) whenever we change the coordinates. The group 
GL(m,K) is called the structure group of TM. In this way fibres are 
interwoven together to form a tangent bundle, which consequently may have quite 
a complicated topological structure. 

We note en passant that the projection jt can be defined globally on M. It 
is obvious that niii) — p does not depend on a special coordinate chosen. Thus, 
it : T M — > M is defined globally with no reference to local charts. 



Let X e X(M) be a vector field on M. X assigns a vector X\ p e T p M 
at each point p e M. From our viewpoint, X is looked upon as a smooth map 
M — »■ T M . This map is not utterly arbitrary since a point p must be mapped to 
a point u e TM such that it(u) = p. We define a section (or a cross section) 
of TM as a smooth map s : M — »■ TM such that it o s — id m- If a section 
Si : U j — > T Ui is defined only on a chart U , . it is called a local section. 

9.2 Fibre bundles 

The tangent bundle in the previous section is an example of a more general 
framework called a fibre bundle. Definitions are now in order. 

9.2.1 Definitions 

Definition 9.1. A (differentiable) fibre bundle M, F, G) consists of the 

following elements: 

(i) A differentiable manifold E called the total space. 

(ii) A differentiable manifold M called the base space. 

(iii) A differentiable manifold F called the fibre (or typical fibre). 

(iv) A surjection jt : E — »■ M called the projection. The inverse image 
it~ l ( p) = F p = F is called the fibre at p. 

(v) A Lie group G called the structure group, which acts on F on the left. 

(vi) A set of open covering {(/, } of M with a diffeomorphism </>,- : U, x F -» 
jt~ l (Uj) such that it o r/j, ( p, f) = p. The map </>,- is called the local 
trivialization since (p~ l maps tL 1 (Ufi onto the direct product Ui x F. 

(vii) If we write fii(p, f) = fii.p(f), the map </>,-, p : F — > F p is a 
diffeomorphism. On Ui fl Uj 0, we require that tij(p) = tpfip ° <Pj,p ■ 
F — > F be an element of G. Then tpj and (/> j are related by a smooth map 
tjj : Ui fl Uj -» G as (figure 9.2) 

fijip, f ) = fiiip, tij(p)f). (9.4) 

The maps tjj are called the transition functions. 

[Remarks: We often use a shorthand notation E — % M or simply E to denote a 
fibre bundle (£, it, M, F, G). 

Strictly speaking, the definition of a fibre bundle should be independent of 
the special covering {[/, } of M. In the mathematical literature, this definition 
is employed to define a coordinate bundle ( E , it, M, F, G, {Ui}, {(/>/}). Two 
coordinate bundles ( E , it, M, F, G, {Uj}, {0/}) and (E, it, M, F, G, { V/ } , {V'v}) 
are said to be equivalent if ( E , it, M, F, G, {Ui} U {Vj}, {<pi} U {fij}) is again a 
coordinate bundle. A fibre bundle is defined as an equivalence class of coordinate 
bundles. In practical applications in physics, however, we always employ a certain 


F F 



Ui n Uj 


Figure 9.2. On the overlap Uj fl Uj , two elements fj , fj e F are assigned to u e tt 1 (p) , 
p e Ui O Uj . They are related by tjj ( p ) as fj = tjj ( p) fj . 

definite covering and make no distinction between a coordinate bundle and a fibre 
bundle.] 

We need to clarify several points. Let us take a chart Uj of the base space M. 
7t~ l (Uj) is a direct product diffeomorphic to Uj x F, c/>~ i : jt~ l (Uj ) ->{/,• x F 
being the diffeomorphism. If Uj fl Uj ^ 0, we have two maps 4>j and <pj on 
Uj fl Uj. Let us take a point u such that it(u) = p e Uj fl Uj. We then assign 
two elements of F, one by t/f -1 and the other by (pj 1 , 

4>T\u) = (p, fj), 4>j\u) = (p, fj) (9.5) 

see figure 9.2. There exists a map tjj : Uj fl U j G which relates fj and fj as 

fj — tjj (p)fj. This is also written as (9.4). 

We require that the transition functions satisfy the following consistency 
conditions: 

tjj(p) = identity map (p e Uj) (9.6a) 

tij(p) = tjiipr' (p e Uj n Uj) (9.6b) 

tij(p) ■ tjk(p) = t lk (p) (p e Uj nUj n Uk). (9.6c) 

Unless these conditions are satisfied, local pieces of a fibre bundle cannot be glued 
together consistently. If all the transition functions can be taken to be identity 
maps, the fibre bundle is called a trivial bundle. A trivial bundle is a direct 
product M x F. 



Given a fibre bundle E — > M, the possible set of transition functions is 
obviously far from unique. Let {{/, } be a covering of M and {</>, } and {(/>, } be two 
sets of local trivializations giving rise to the same fibre bundle. The transition 
functions of respective local trivializations are 

tijip) = °<Pj,p (9.7a) 

t,j(p) = °4>j,p- (9.7b) 

Define a map gj(p) : F -+ F at each point p e M by 

8i(p) = ° 4>i,p- (9.8) 

We require that gi(p) be a homeomorphism which belongs to G. This 
requirement must certainly be fulfilled if {(/>, } and {</), } describe the same fibre 
bundle. It is easily seen from (9.7) and (9.8) that 

tij(p) = gi(p)~ l ° tij(p) o gj(p). (9.9) 

In the practical situations which we shall encounter later, tjj are the gauge 
transformations required for pasting local charts together, while gj corresponds 
to the gauge degrees of freedom within a chart U ,• . If the bundle is trivial, we may 
put all the transition functions to be identity maps. Then the most general form of 
the transition functions is 


tij(p) = gi(p ) l gj(p)- (9-10) 

Let E — M be a fibre bundle. A section (or a cross section) s : M —> E 
is a smooth map which satisfies nos = id m- Clearly, s(p) = s \ p is an element of 
F p = n~ l (p). The set of sections on M is denoted by T(M, F). If U C M, we 
may talk of a local section which is defined only on U . F ((/, F) denotes the set of 
local sections on U. For example, F(M, T M) is identified with the set of vector 
fields X(M). It should be noted that not all fibre bundles admit global sections. 

Example 9.1. Let £ be a fibre bundle E — .S’ 1 with a typical fibre F — [— 1 , 1]. 
Let U\ — (0, 2n) and Ui = (—n,n) be an open covering of .S 1 and let 
A — (0, 7r) and B = (n, 2n) be the intersection U \ fl Ui. see figure 9.3. The 
local trivializations <p\ and <f >2 are given by 

(p^ l (u) = cp^ 1 (u) = (0 , t) 

for 6 e A and t e F. The transition function /‘ 12 (C)- & e A, is the identity map 
1 12 ( 0 ) : ( h- (. We have two choices on B\ 

(i) 4>-'(u) = {e,t),(p-'(u) = (ej) 

(ID ( p-\u) = {e,t), ( t>-'(u) = (e,-t) 


B 


Figure 9.3. The base space S 1 and two charts U\ and U 2 over which the fibre bundle is 
trivial. 


(a) 


(b) 



Figure 9.4. Two fibre bundles over .S’ 1 : (a) is the cylinder which is a trivial bundle S 1 x /; 
(b) is the Mobius strip. 


For case (I), we find that tn(0) is the identity map and two pieces of the local 
bundles are glued together to form a cylinder (figure 9.4(a)). For case (II), we 
have fi2(0) : 1 i-> —t, 6 e B, and obtain the Mobius strip (figure 9.4(b)). Thus, a 
cylinder has the trivial structure group G = {e} where e is the identity map of F 
onto F while the Mobius strip has G — {e, g} where g : t i-> —t. Since g 2 — e, 
we find G = Z 2 . A cylinder is a trivial bundle S' 1 x F, while the Mobius strip is 
not. [ Remark : The group Z 2 is not a Lie group. This is the only occasion we use 
a discrete group for the structure group.] 

9.2.2 Reconstruction of fibre bundles 

What is the minimal information required to construct a fibre bundle? We now 
show that for given M, {{/, }, f (/ (p), F and G, we can reconstruct the fibre bundle 
(E, tv, M, F, G). This amounts to finding a unique tv, E and cpi from given data. 
Let us define 

X = |J UixF. (9.11) 

i 

Introduce an equivalence relation ~ between (p, f) e Uj x F and (q, f) e 
Uj x F by (p, f) ~ ( q , f) if and only if p — q and f' = tij(p)f. A fibre 




bundle E is then defined as 

E = Xj~. (9.12) 

Denote an element of E by [ ( /; , /)]. The projection is given by 

n :[(p,f)]^ p. (9.13) 

The local trivialization 0/ : Ui x F — > is given by 

<t>i : (P, /) ^ [(P, /)]• (9.14) 

The reader should verify that E, n and {</>,•} thus defined satisfy all the axioms of 
fibre bundles. Thus, the given data reconstruct a fibre bundle E uniquely. 

This procedure may be employed to construct a new fibre bundle from an old 
one. Let ( E. jr, M, F, G) be a fibre bundle. Associated with this bundle is a new 
bundle whose base space is M, transition function tjj(p), structure group G and 
fibre F' on which G acts. Examples of associated bundles will be given later. 


9.2.3 Bundle maps 


Let E — M and E' M' be fibre bundles. A smooth map / : E' — > E 
is called a bundle map if it maps each fibre F' p of E' onto F q of E. Then / 
naturally induces a smooth map / : M' — > M such that f(p ) = q (figure 9.5). 
Observe that the diagram 


E' — f -> E 1 

( u 

-U f(u) 

\ 

*1 

r 




M' - 

U M 

\ p 

/ 

— ► q 

/ 


(9.15) 


commutes. [ Caution : A smooth map / : E' — »■ E is not necessarily a bundle 
map. It may map u, v e F' p of E' to f(u) and f(v) on different fibres of E so 
that jr(/ (u)) / 7 r(f(v)).\ 




Figure 9.6. Given a fibre bundle E M, a map / : N — *■ M defines a pullback bundle 
f*E over N. 


9.2.4 Equivalent bundles 

Two bundles E' — M and E M are equivalent if there exists a bundle map 
/ : E' — > E such that / : M —> M is the identity map and / is a diffeomorphism: 



(9.16) 


This definition of equivalent bundles is in harmony with that given in the remarks 
following definition 9.1. 


9.2.5 Pullback bundles 

Let E -—>■ M be a fibre bundle with typical fibre F. If a map / : N -> M is 
given, the pair ( E , f ) defines a new fibre bundle over N with the same fibre F 
(figure 9.6). Let f*E be a subspace of N x E, which consists of points ( p, u ) 
such that f(p) — 7 r(«). f*E = {(/?, u) e N x E\f(p) = 7t(u)} is called the 
pullback of E by /. The fibre F p of f*E is just a copy of the fibre Ff( P ) of E. If 
we define f*E — > N by tt\ : ( p , u) i-> p and f*E E by (p, u) u, the 
pullback f*E may be endowed with the structure of a fibre bundle and we obtain 
the following bundle map, 





Figure 9.7. The transition function /*. of the pullback bundle f*E is a pullback of the 
transition function tjj of E. 


The commutativity of the diagram follows since Jt(jt 2 (p, u)) — tx{u) — f(p ) = 
f(rc\{p, u)) for (p, u ) e f*E. In particular, if N — M and / = id m. then two 
fibre bundles f*E and E are equivalent. 

Let {{/,} be a covering of M and {</>/} be local trivializations. {f~ l {Uj)} 
defines a covering of N such that f*E is locally trivial. Take u e E such 
that 7 t(m) = f(p) e Uj for some p e N. If (p^ ] (u) — ( f(p), ft ) we find 
i//- r ] (p, u ) = (p, fj ) where i/r,- is the local trivialization of f*E. The transition 
function tjj at f(p) e Uj fl Uj maps fj to fj = by ( f(p))fj • The corresponding 
transition function f*. of f*E at p e f~ l (Uj ) fl f~ l (U j) also maps fj to f: see 
figure 9.7. This shows that 


t* } (p) = t,j(f( P ))- 


(9.18) 


Example 9.2. Let M and N be differentiable manifolds with dim M — dim N — 
m . Let / : /V — > M be a smooth map. The map / induces a map it 2 : T N — > T M 
such that the following diagram commutes: 

TN -3* TM 

^i| y (9.19) 

N -U M. 

Let W = W v d/dy v be a vector of T p N and V = V^d/dx 11 be the corresponding 
vector of T f( P )M. If T N is a pullback bundle f*(T M ), jtn maps T p N to Tf( P )M 
diffeomorphically. This is possible if and only if 7t2 has the maximal rank m at 





each point of TN. Let (p(f(p)) = (f l (y), . .., f m (y)) be the coordinates of 
f(p ) in a chart (U, q>) of M, where y — (p(p) are the coordinates of p in a chart 
(V, V') of N. The maximal rank condition is given by det(3/ 7t (y)/3y y ) ^ 0 for 
any p e N. 

9 . 2.6 Homotopy axiom 

Let / and g be maps from M' to M. They are said to be homotopic if there 
exists a smooth map £ : M' x [0, 1] -» M such that F(p, 0) = /( p) and 
F(p, 1) = g(p) for any p e M' , see section 4.2. 

Theorem 9.1. Let E — M be a fibre bundle with fibre F and let / and g be 
homotopic maps from IV to M. Then f*E and g*E are equivalent bundles over 
N. 


The proof is found in Steenrod (1951). Let M be a manifold which is 
contractible to a point. Then there exists a homotopy F : M x I — »• M such 
that 

F(p. 0) = p F(p, 1) = po 

where po e M is a fixed point. Let E — % M be a fibre bundle over M and 
consider pullback bundles h^E and h*E, where h t (p ) = F(p,t). The fibre 
bundle h* E is a pullback of a fibre bundle { po} x F and hence is a trivial bundle: 
h*E ~ M x F . However, h^E = E since ho is the identity map. According to 
theorem 9.1, h^E = E is equivalent to h*E — M x F, hence £ is a trivial bundle. 
For example, the tangent bundle TW is trivial. We have obtained the following 
corollary. 

Corollary 9.1. Let E — % M be a fibre bundle. E is trivial if M is contractible to 
a point. 

9.3 Vector bundles 

9.3.1 Definitions and examples 

A vector bundle E — M is a fibre bundle whose fibre is a vector space. Let 
F be W and M be an m -dimensional manifold. It is common to call k the 
fibre dimension and denote it by dim E, although the total space E is m + k 
dimensional. The transition functions belong to GL(L, K), since it maps a vector 
space onto another vector space of the same dimension isomorphically. If £ is a 
complex vector space C k , the structure group is GL (k, C). 

Example 9.3. A tangent bundle TM over an m-dimensional manifold M is a 
vector bundle whose typical fibre is , see section 9.1. Let u be a point in 
TM such that i r(u) = p e Uj fl Uj, where {[/,•} covers M. Let = (Pi(p) 



()> ,< = (pj(p )) be the coordinate system of C// (£//). The vector V corresponding 
to u is expressed as V — V IJ 'd/dx^\p = V^d/dy^l p . The local trivializations are 

4>T\u) = ( P , {V^}) 07 ' (M) = Oh {V^}). (9.20) 

The fibre coordinates {V^ 1 } and { V 1 ' } are related as 

= G li v (p)V v (9.21) 


where {G^yf/H} = {(dx^/d y v ) p ] e GL(m, E). Hence, a tangent bundle 
is (TM, n, M, E m , GL(m, E)). Sections of TM are the vector fields on M; 
X(M) = F(M, T M). 

For concreteness let us work out T S 2 . Let the pair U\i = S 2 — {South Pole} 
and Us = S 2 — {North Pole} be an open covering of S 2 . Let (X. Y) and ( U . V ) 
be the respective stereographic coordinates (example 8.1). They are related as 

U = X/(X 2 + Y 2 ) V = - Y/(X 2 + Y 2 ). (9.22) 

Take u e TS 2 such that tt(u) — p e Us (T Us- Let cps and <ps be the respective 
local trivializations such that <f>^ l (u) = (p, V^) and = (p, V$). The 

transition function is 


3 (U,V) 1 / — cos 26 —sin 26 \ 

Nn(/N = Y) = 72 V sin 26 -cos2 6 ) 


(9.23) 


where we have put X — r cos 6 and Y = r sin 0. The transition of the components 
of the tangent vectors consists of a rotation of { V[ J } by an angle 26 followed by a 
rescaling. The reader should verify that fNs(p) = tsN(p) -1 - 

Example 9.4. Let M be an m -dimensional manifold embedded in W 1+k . Let 
N p M be the vector space which is normal to T,,M in W 1+k , that is, U ■ V = 0 
with respect to the Euclidean metric in W n+k for any U € N p M and V e T p M. 
The vector space N p M is isomorphic to E*. The normal bundle 


NM = [J N p M 

peM 


is a vector bundle with the typical fibre E^ . 

Consider the sphere S 2 embedded in E 3 . The normal bundle NS 2 is 
imagined as S 2 whose surface is pierced perpendicularly by straight lines. NS 2 is 
a trivial bundle S 2 x E. 

A vector bundle whose fibre is one-dimensional ( F = E or C) is called a 
line bundle. A cylinder 5 1 x E is a trivial E-line bundle. A Mobius strip is also a 
real line bundle. The structure group GL(1, E) = E— {0} or GL(1, O = C— {0} 
is Abelian. 



In the following, we often consider the canonical line bundle L. Recall that 
an element p of CP" is a complex line in C" +1 through the origin (example 8.3). 
The fibre n~ l (p) of L is defined to be the line in C !+1 which belongs to p. More 
formally, let P ,+1 = CP" x C !+1 be a trivial bundle over CP" . If we write an 
element of 7" +1 as ( p , v), p e CP" , v e C !+1 , L is defined by 

L = {(/?, v ) e I n+l \v — ap, a e C}. 

The projection is (/?, i>) p. 

Example 9.5. The (trivial) complex line bundle L = E 3 x C is associated with 
the non-relativistic quantum mechanics defined on E 3 . The wavefunction i//(x) is 
simply a section of L . 

Let us consider a wavefunction i j/(x) in the field of a magnetic monopole 
studied in section 1.9. When a monopole is at the origin, i fr(x) is defined on 
E 3 — {0} and we have a complex line bundle over E 3 — {0}. If we are interested 
only in the wavefunction on S 2 surrounding the monopole, we have a complex 
line bundle over S 2 . Note that S 2 is a deformation retract of E 3 — {0}. 

9.3.2 Frames 

On a tangent bundle PM, each fibre has a natural basis {d/dx 1 *} given by the 
coordinate system x 1 ' on a chart Uj. We may also employ the orthonormal basis 
{e a } if M is endowed with a metric, d/dx 1 * or {e a } is a vector field on Ui and the 
set {d/dx^} or {e a } forms linearly independent vector fields over Ui. It is always 
possible to choose m linearly independent tangent vectors over Ui but it is not 
necessarily the case throughout M. By definition, the components of the basis 
vectors are 

d/dx 1 * — (0, ..., 0, 1, 0, ..., 0) 

M 

or 

e a = (0, ..., 0, 1, 0, ..., 0). 

a 

These vectors define a (local) frame over U / , see later. 

Let E —>■ M be a vector bundle whose fibre is E* (or C*). On a chart 
Ui, the piece K~ l (Ui) is trivial, 7t~ l (Ui) = Uj x E*, and we may choose k 
linearly independent sections {e\(p ), . . . , eUp)} over Ui. These sections are said 
to define a frame over Ui . Given a frame over Ui , we have a natural map F p —> F 
(=E Ar or C k ) given by 

V = V a e a (p ) i — > {W 0 '} e F. (9.24) 


The local trivialization is 


<Pi\V)= (p,{V“(p)}). 


(9.25) 



By definition, we have 


<t>i (/MO, .... 0, 1, 0, 0}) =e a {p). (9.26) 

a 

Let Uj fi Uj ^ 0 and consider the change of frames. We have a frame 
[e\ (p), . . . , ek(p)} on £/; and {ei (p ), . . . , ek(p)} on Uj, where p e Uj fl Uj. A 
vector ep ( p) is expressed as 


ep(p) = e a (p)G(p) a p (9.27) 

where G(p) 01 p e GL(k, M) or GL (k, Q. Any vector V e n~ 1 (p) is expressed as 

V = v a e a (p) = v a e a (p). (9.28) 

From (9.27) and (9.28) we find that 

V? = G- l (p)P a V a (9.29) 

where G -1 (p)P a G(p) a y — G(p)P a G~ l (p) a y — S^y. Thus, we find that the 

transition function tjj (p) is given by a matrix G _1 ( p ). 

9.3.3 Cotangent bundles and dual bundles 

The cotangent bundle T*M = [J peM T*M is defined similarly to the tangent 
bundle. On a chart U: whose coordinates are x 1 ' , the basis of T*M is taken to be 

i p 

{da: 1 , .... dx"'}, which is dual to {3/3x M }. Let y M be the coordinates of Uj such 
that Ui C\Uj 0. For p e Uj fl Uj, we have the transformation, 

d ' v " = 4t "(£)/ (9 - 30> 

A one-form to is expressed, in both coordinate systems, as 

to = (On dx 11 — a>n ^ 


from which we find that 

a>n = Gu v (i j)to v (9.31) 

where G^ip ) = (3x y /3 y^) P corresponds to the transition function ? ; , ( p). Note 
that T(M, T*M ) = £2*(M). 

This cotangent bundle is easily extended to more general cases. Given a 
vector bundle E \ M with the fibre F, we may define its dual bundle E* 4 M. 
The fibre F* of E* is the set of linear maps of E to R (or C). Given a general basis 
[e a (p)} °f F p , we define the dual basis {0 a {p)} of F* by (6 a (p), ep(p)) = 8 a p. 



9.3.4 Sections of vector bundles 

Let s and s' be sections of a vector bundle E —> M. The vector addition and the 
scalar multiplication are pointwisely defined as 

(S + s')(p) = s(p) + s'(p) (9.32a) 

ifsKp) =f(p)s(p) (9.32b) 

where p e M and / e IF ( M ) . The null vector 0 of each fibre is left invariant 
under GLG, IK) (or GLG, Q) and plays a distinguished role. Any vector bundle 
E admits a global section called the null section sq e T(M, IF) such that 
4>~ l (so(p)) — (p, 0) in any local trivialization. 

For example, let us consider sections of the canonical line bundle L over 
CP" . Let Ci//) be the inhomogeneous coordinates and {z v } be the homogeneous 
coordinates on Up . The local section Sp over Up is of the form 

The transition from one coordinate system to the other is carried out by a scalar 
multiplication: s v = (z >l /zFjSp. Let L* be the dual bundle of L. Corresponding 
to .V// , we may choose a dual section s* such that .y* (>t//) = 1. From this, we find 
that the transition function of .y* is a multiplication by z v /z^, s* = (z v /z^)s* . 

A fibre metric h^ v (p) is also defined pointwisely. Let s and s' be sections 
over Uj . The inner product between s and s' at p is defined by 

G, s')p = h liV (p)s' l (p)s' v (p ) (9.33a) 

if the fibre is IK* . If the fibre is C* we define 

G, s’) p = h, JLV {p)s^{p)s' v {p). (9.33b) 

We have more about this subject in section 10.4. 


9.3.5 The product bundle and Whitney sum bundle 

Let E M and IF \ M' be vector bundles with fibres F and F' respectively. 

The product bundle 

E x E' 7TX7T '> MxM' (9.34) 

is a fibre bundle whose typical fibre is F © F' . [A vector in F © F' is written as 

where V e F and W e F' . 



Vector addition and scalar multiplication are defined by 



V + V' \ 

w+w ) 



and 



Let {e a } and {fp} be bases of F and F' respectively. Then {e a } U {fp} is a basis 
of F © F' and we find that dirnfF © F') — dim F + dim F' .] If tt(u) = p and 
it' (if) — p' the projection it x tt' acts on (u, if) e E x E’ as 

7 r x 7t'(u, u) = (p, p'). (9.35) 

The fibre at ( p , p') is F p © F' For example, if M = M\ x M 2 , we have 
TM = TM\ x TM 2 . 

JT 7T / 

Let E — > M and E' — »• M be vector bundles with fibres F and F' 
respectively. The Whitney sum bundle E © E' is a pullback bundle of E x E' 
by / : M — > M x M defined by f(p) = (p, p), 

E © E' E x E' 

(9.36) 

M -U M x M. 


Thus, E®E' — {{u,u') e E x E'\n X7T f (u, u') = (p, p)}. The fibre of a Whitney 
sum bundle is F © F' . (jc x jz')~ { (p) is isomorphic to Tt~ l (p) © n / ~ 1 (p) — 
Fp ® F^. In short, E © E' is a bundle over M whose fibre at p is F p © F' p . Let 
{(/,} be an open covering of M and {tf:} and {tf, } be the transition functions 

V V 

of E and E' respectively. Then the transition function 7’ (/ of E © E' is a 
(dim F + dim F') x (dim F + dim F' ) matrix 


T,j(p) = 


tfj(p) 0 \ 

0 tf- (p) ) 


(9.37) 


which acts on F © F' on the left. 


Example 9.6. Let E — T S 2 and E' = NS 2 defined in M 3 . Take u e T S 2 and 

r \ 1 1 

v e NS whose local trivializations are </> ( - (u) — (p, V ) and 1 jr, ( v ) = ( q , IT), 
respectively, where p, q e S 2 ,V e K? and IT e M. If (u, v ) is a point of the 
product bundle E x E\ we have a trivialization <J >; j = </>,- x \[rj such that 

v) — (p. q\ V, W). (9.38a) 

If, however, (u, v) e E © E' , u and v satisfy the stronger condition it(u) — Jt'(v) 
(—p, say). Thus, we have 

®7 1 (u,v) = (p;V,W). (9.38b) 

The Whitney sum T S 2 © NS 2 , S 2 being embedded in E 3 , is a trivial bundle over 
S 2 , whose fibre is isomorphic to M 3 . 



9.3.6 Tensor product bundles 

Let £ -—*■ M and E' —>■ M be vector bundles over M. The tensor product 
bundle E <g> E' is obtained by assigning the tensor product of fibres F p ® F' p to 
each point p e M. If [e a } and {fp} are bases of F and F' , F (g) F' is spanned by 
{e a <g> fp} and, hence, dim(£ <g> E ') = dim E x dim E' . 

Let (££)' E = E ® ® E be the tensor product bundle of r E. If {e a } is the 

basis of the fibre F of E, the fibre of (f)' E is spanned by \e a] ® ® e Ul } . If we 

define A by 

e a A ep = e a <E> ep - ep ® e a (9.39) 

we have a bundle A r (E) of totally anti- symmetric tensors spanned by {e ai A 
. . . A e ar }. In particular, Q r (M), the space of /--forms on M, is identified with 
T(M, A r (T*M)). 

Exercise 9.1. Let E\, E 2 and £3 be vector bundles over M. Show that <g> is 
distributive: 

Ei ® (£2 © £ 3 ) = (£1 <B> £ 2 ) © (£1 ® £ 3 )- (9.40) 

Express the transition functions of E\ <g> (£2 © £ 3 ) in terms of those of £ 1 , £2 
and £ 3 . 


9.4 Principal bundles 
9.4.1 Definitions 

A principal bundle has a fibre £ which is identical to the structure group G. A 
principal bundle P M is also denoted by P(M, G) and is often called a G 
bundle over M. 

The transition function acts on the fibre on the left as before. In addition, we 
may also define the action of G on £ on the right. Let </>,■ : (/, x G — > jr ~ 1 ( (/,■ ) 
be the local trivialization given by cf>~ l (u ) = ( p , g,-), where u e t r — 1 ( C// ) and 
p = Jt(u). The right action of G on jt~ l (Ui ) is defined by = ( p , g,a), 

that is (figure 9.8), 

ua = <pj(p, gta) (9.41) 

for any a e G and u e it~ l {p). Since the right action commutes with the 
left action, this definition is independent of the local trivializations. In fact, if 
p eUtd Uj, 


ua = <j>j(p, gja) = f jip , tji(p)gia) = <j>i(p, gta). 

Thus, the right multiplication is defined without reference to the local 
trivializations. This is denoted by P x G —*■ P or (i/,a) h> ua. Note that 
n(ua ) = 7T (u ) . The right action of G on tL 1 (p) is transitive since G acts on G 
transitively on the right and F p — tt~ ] (p) is diffeomorphic to G. Thus, for any 


U, 



P U, 


Figure 9.8. The right action of G on P . 


«i, M 2 e jr~ l (p) there exists an element a of G such that u i = 112 a. Then, if 
tv ( u) — p, we can construct the whole fibre as 7t~ l (p) = {ua\a e G}. The action 
is also free; if it a = u for some u e P, a must be the unit element e of G. In fact, 
if u = <pi(p , g{), we have 0, (p, gta) = <pi(p , gi)a — ua — u = (j),{p, g t ). Since 
tpi is bijeetive, we must have gja — gj, that is, a — e. 

Given a section .v 1 ( p) over C/,- , we define a preferred local trivialization 
1 pi : Uj x G — > as follows. For n e n~ 1 (p), p e Uj, there is a unique 

element g u e G such that u = Sj(p)g u . Then we define 0 / by </> t ' ( m ) = ( p , g u ). 
In this local trivialization, the section s ,• (p) is expressed as 

Si(p) = <Pi(p,e). ( 9 . 42 ) 

This local trivialization is called the canonical local trivialization. By definition 
< pi(p , g) — (pi(p, e)g — Si(p)g. If p e U/ fl Uj, two sections Sj(p) and5 ; (/ 2 ) are 
related by the transition function t t j ( p) as follows 

Si(p) = <Pi(p , e) - cpj(p, tji(p)e) = <pj(p, tjtip )) 

= e)tji(p) = Sj(p)tji(p). ( 9 . 43 ) 

Example 9.7. Let P be a principal bundle with fibre U(l) = .S' 1 and the base 
space S 2 . This principal bundle represents the topological setting of the magnetic 
monopole (section 1 . 9 ). Let { U\\ . U% \ be an open covering of S 2 , U\i (Us) being 
the northern (southern) hemisphere. If we parametrize S 2 by the usual polar 
angles, we have 

£/n = {(0, 0)|O < 0 < 7r/2 + s, 0 <<p < 2 jt } 

Us — {(0, 4>)\iz/2 — s < 6 < 7T, 0 < (f> < 2jtj. 





The intersection U\s fl Us is a strip which is essentially the equator. Let (j)\s and 
cf>s be the local trivializations such that 

= ( p , e” N ) 0s" 1 (m) = (p, e” s ) (9.44) 

where p = n(u). Take a transition function tss(p) of the form e 1 "^ where n 
must be an integer so that f\js( p) may be uniquely defined on the equator. Since 
?ns rnaps the equator S' 1 to U(l), this integer characterizes the homotopy group 
7ri(U(l)) = Z. The fibre coordinates and as are related on the equator as 

e i«N = e m0 e i« s (9.45) 

If n — 0, the transition function is the unit element of U(l) and we have a 
trivial bundle Pq — S 2 x S l . If n ^ 0, the U(l)-bundle P n is twisted. It is 
remarkable that the topological structure of a fibre bundle is characterized by an 
integer. The integer characterizes how two local sections are pasted together at 
the equator. Accordingly, the integer corresponds to the element of the homotopy 
group 7Ti(U(l)) = Z. 

Since U(l) is Abelian, the right action and the left action are equivalent. 
Under the right action g — e lA , we have 

0- 1 («g) = (/7,e i( “ N+A) ) (9.46a) 

</>“' 0 U g ) = (p, e i( “ s+A) ). (9.46b) 

The right action corresponds to the U(l)-gauge transformation. 

Example 9.8. If we identify all the infinite points of the Euclidean space , the 
one-point compactification S m = W n U {oo} is obtained. If a trivial G bundle is 
defined over TR m we shall have a new G bundle over S m after compactification, 
which is not necessarily trivial. Let P be an SU(2) bundle over S' 4 obtained from 
K 4 by one-point compactification. This principal bundle represents an SU(2) 
instanton (section 1.10). Introduce an open covering {C/n, Us) of S' 4 , 

Un = {( x , y, z, t)\x 2 + y 2 + z 2 + t 2 < R 2 + e} 

Us — {(x, y, z, t)\R 2 - e < x 2 + v 2 + z 2 + t 2 } 

where S’ is a positive constant and s is an infinitesimal positive number. The 
thin intersection Us H Us is essentially S 3 . Let tss(p) be the transition function 
defined at p € Us (T Us- Since tns maps S 3 to SU(2), it is classified by 
7r 3 (SU(2)) = Z. The integer characterizing the bundle is called the instanton 
number. If tss(p) is taken to be the unit element e e SU(2), we have a trivial 
bundle Pq = S 3 x SU(2), which corresponds to the homotopy class 0. Non-trivial 
bundles are obtained as follows. We first note that SU(2) = S 3 (example 4.12). 
An element A e SU(2) is written as 




where \u\ 2 + |u| 2 = 1. Separating u and v as u = t + i z and v = y + ix, we 
find t 2 + x 2 + y 2 + z 2 — 1. Thus SU(2) is regarded as the unit sphere S 3 and 
tt 3 (SU(2)) = JtiiS 3 ) = 7L classifies maps from S 3 to SU(2) = S 3 . The identity 
map f : S 3 S 3 = SU(2) is 


/ (x, y, z, t) i-> 


t + iz 
—y + ix 


y + ix 
t - i z 


— th + i(xcr.v + yoy + zo z ) 


(9.47) 


where I 2 is the 2x2 unit matrix and the a /; are the Pauli matrices. Let us take 
a point p — (x, y, z, t) e Us fl Us- If R — (x 2 + y 2 + z 2 + t 2 ) 3 ! 2 denotes the 
radial distance of p, the vector {x/R, y/R, z/R, t/R ) has unit length. We assign 
an element of SU(2) to the point p as 

tss(p) = 2 +i^x'cr/J. (9.48) 

^ i ' 

Let (ps and 0s be the local trivializations, 

^(m) = (P, gN) (l>s 1 («) = (P, gs) (9.49) 

where p — n(u) and gN, gs e SU(2). On Us O Us, we have 

gN = ^ r/ 2 + iX^x'a/^gs. (9.50) 


While (t,x) scans S 3 once, tss(p) sweeps SU(2) once, hence this bundle 
corresponds to the homotopy class 1 of 7T3(SU(2)). It is not difficult to see that 
the transition function corresponding to the homotopy class n is given by 

tss(p) = -^r(^ fl + i X! x,cr ') • (9 - 51) 


To continue our study of monopoles and instantons, we have to introduce 
connections (the gauge potentials) on the fibre bundle. We will come back to 
these topics in the next chapter. 

Example 9.9. Hopf has shown that S 3 is a U( I ) bundle over S 2 . The unit three- 
sphere embedded in M 4 is expressed as 

(x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 + (x 4 ) 2 = 1. 

If we introduce z° — x 1 + ix 2 and z 1 = x 3 + ix 4 , this becomes 

|z°| 2 + I z 1 1 2 = 1. 


(9.52) 



Figure 9.9. Stereographic coordinates of the sphere S 2 . (X, Y) is defined with respect to 
the projection from the North Pole while (U , V) with respect to the projection from the 
South Pole. 


Let us parametrize S 2 as 


(If 1 ) 2 + (f 2 ) 2 + (f 3 ) 2 = 1. 


The Hopf map jt : S 3 — >• S 2 is defined by 
I 1 = 2(x x x 3 + x 2 x 4 ) 


$ 2 = 2(x 2 x 3 — x 1 x 4 ) 

^ = ( X 1 ) 2 + ( X 2 )2- ( X 3 ) 2- ( X 4 ) 2. 


(9.53a) 

(9.53b) 

(9.53c) 


It is easily verified that n maps .S' 3 to .S 2 since 

or 1 ) 2 + (t 2 ) 2 + (t 3 ) 2 = [(x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 + (x 4 ) 2 ] 2 = i. 

Let (X, Y) be the stereographic projection coordinates of a point in the 
southern hemisphere Us of S 2 from the North Pole. If we take a complex plane 
which contains the equator of S 2 , Z — X + iL is within the circle of unit radius. 
We found in example 8.1 that (figure 9.9) 

z = = FTI? = ? e t,s) ' <9 54a) 

Observe that Z is invariant under 


($ e U S ). 


(9.54a) 


(z°, z 1 ) h* (A.z°, Lz 1 ) 

where X e U(l). Since |A.| = 1, the point (Xz°. Xz 1 ) is also in S’ 3 . The 
stereographic coordinates ( U , V ) of the northern hemisphere (7n projected from 
the South Pole are given by 

£'-i| 2 x 3 + ix 4 z 1 

TI T TT i :T7 ^ ~ 


w = U + iV = 


3 i ■ 4 
X J + IX 

x 1 + ix 2 


(| e (7 n). 


(9.54b) 



Note that Z = 1/ W on the equator Us fl (A,. 

The fibre bundle structure is given as follows. We first define the local 
trivializations, q : tt~ 1 (Us) — »■ Us x U(l) by 

(z 0 ^ 1 )^ (z°/z 1 ,z 1 /k 1 l) (9.55a) 

and 4 : 7T _1 (C/n) -> Us x U(l) by 

(z°, z 1 ) i-» (zVz°, z°/|z°|). (9.55b) 

Observe that these local trivializations are well defined on each chart. For 
example, ^ 0 on Us, hence both z 1 /z° = U + iV and z 0 / |z°| are non- 
singular. On the equator, | 3 = 0, we have |z°| = |z* | = 1 /*J2. Accordingly, the 
local trivializations on the equator are 

</>s 1 : (z°, z 1 ) (z°/zW2z l ) (9.56a) 

and 

0N 1 : (z°> z 1 ) (z'/z 0 , V2 z°). (9.56b) 

The transition function on the equator is 

V2z° 1 2 

fas(£) — —j=— = t 1 + if 2 e U(l). (9.57) 

V2Z 1 

If we circumnavigate the equator, ?ns(§) traverses the unit circle in the complex 
plane once, hence the U(l) bundle S 3 — S 2 is characterized by the homotopy 
class 1 of 7Ti (U(l )) = Z. Trautman (1977), Minami (1979) and Ryder (1980) 
have pointed out that a magnetic monopole of unit strength is described by the 
Hopf map S 3 — .S' 2 . 

The Hopf map can be understood from a slightly different point of view. We 
regard .S 3 as a complex one-sphere 

4 = {(z°,z 1 )eC 2 ||zT-Hz 1 | 2 = I}- 

Define a map n : 5j, — »■ CP 1 by 

(z°, z 1 ) [(Z°, z 1 )] = {A(Z°, Z 1 )^ e C — {0}}. (9.58) 

Under this map, points of S 3 of the form A(z , z 1 ), A. = 1 are mapped to a single 
point of CP 1 — .S' 2 . This is the Hopf map tv : .S' 3 — > .S' 2 obtained earlier. This 
is easily generalized to the case of the quaternion iHL The quaternion algebra is 
defined by the product table, 

i 2 = j 2 = k 2 = — 1 i j — —ji — k 

jk = —k j = i ki = —ik = j . 



An arbitrary element of H is written as 


q — t + ix + jy + kz. 

Clearly the unit quaternion \q\ — ( t 2 + x 2 + y 2 + z 2 ) 1 / 2 = 1 represents S 3 = 
SU(2). The quaternion one-sphere is given by 

4 = {(9°>9 1 ) e] ® 2 ll9 0 l 2 + l9 1 l 2 =l} (9-59) 

which represents S 7 . The Hopf map, in this case, takes the form 

n : 5^ — > HP 1 (9.60) 

where HP 1 is the quaternion projective space whose element is 

[(9°. 4 1 )] = fa(9°. q l ) el 2 |i)£l- {0}}. (9.61) 

Points of .S' 7 with q\ = 1 are mapped under this map to a single point of 
HP 1 = .S 4 and we have the Hopf map 

jt : S 1 -> S 4 . (9.62) 

The fibre is the unit quaternion S 3 = SU (2) . The transition function defined by 
the Hopf map belongs to the class 1 of 7T3(SU(2)) = Z. An instanton of unit 
strength is described in terms of this Hopf map. 

Octonions define a Hopf map n : S 1 7 -> S 8 . This differs from other Hopf 
maps in that the fibre S 1 is not really a group. So far we have not found an 
application of this map in physics. 1 

Example 9.10. Let H be a closed Lie subgroup of a Lie group G. We show 
that G is a principal bundle with fibre H and base space M — G/H . Define 
the right action of H on G by g i-^ ga, g e G, a e H . The right action is 
differentiable since G is a Lie group. Define the projection n : G -* M = G/H 
by the map it : g i->- [g] = {gh\h e H}. Clearly, g, ga e G are mapped to 
the same point [g] hence jt ( g) — Jt(ga) (=[g]). To define local trivializations, 
we need to define a map /, : G -> H on each chart Let s be a local 
section over [/,• and g e Jt~ l ([g]). Define f) by fi(g) = ^([g]) -1 ^. Since 
•?([,?]) is a section at [g], it is expressed as ga for some a e H and accordingly, 
,v([g])-' g — a~ l g~\g — a" 1 e H. Then we define the local trivialization 
</>,- : Ui x H -> G by 

^ 1 (8) = ([glfi(g)). (9.63) 

It is easy to see that fi(ga) — fi(g)a ( a e H) hence <p^ ] (ga) = (p, fi(g)a) is 
satisfied. Useful examples are (see example 5.18) 

0(n)/0(n - 1) = SO(ra)/SO(n - 1) = S"~ l (9.64) 

U(n)/U(« - 1) = SU(n)/SU(n - 1) = S' 2 " -1 . (9.65) 

1 Octonions are also known as Cayley numbers. The set of octonions is a vector space over R but 
not a field. The product is neither commutative nor associative. See John C Baez, The Octonions 
math. RA/0 105 155 for a recent review. 



9.4.2 Associated bundles 


Given a principal fibre bundle P(M, G), we may construct an associated fibre 
bundle as follows. Let G act on a manifold F on the left. Define an action of 
g e G on P x F by 

(u, f) (ug, g~ l f) (9.66) 

where u e P and / e F. Then the associated fibre bundle (E, jv, M, G, E. P) 
is an equivalence class P x F/G in which two points (u, f ) and ( ug , g~ l f) are 
identified. 

Let us consider the case in which F is a ^-dimensional vector space V . Let p 
be the A'-dimensional representation of G. The associated vector bundle P x p V 
is defined by identifying the points (u, v ) and (ug, p(g)~ l v ) of P x V, where 
u e P, g e G and v e V. For example, associated with P ( M . GL(k, IE)) is a 
vector bundle over M with fibre Er . The fibre bundle structure of an associated 
vector bundle E — P x p V is given as follows. The projection jte '■ E —> M is 
defined by tte(u, v) = n (u). This projection is well defined since tt(u) — n (ug) 
implies iCEiug , p(g)~ l v ) = n(ug) = jte(u, v). The local trivialization is given 
by ipi : Ui x V —>■ (Ui). The transition function of E is given by p(tp (p)) 

where tjj ( p ) is that of P. 

Conversely a vector bundle naturally induces a principal bundle associated 
with it. Let E — ^ M be a vector bundle with dim E = k (i.e. the fibre is 
W k or C k ). Then E induces a principal bundle P(E) = P(M, G) over M 
by employing the same transition functions. The structure group G is either 
GL(£, IK) or GL(A', C). Explicit construction of P(E) is carried out following 
the reconstruction process described in section 9.1. 

Example 9.11. Associated with a tangent bundle TM over an 777 -dimensional 
manifold M is a principal bundle called the frame bundle LM = UpeM L P M 
where L p M is the set of frames at p. We introduce coordinates x 1 ' on a chart Ui. 
The bundle T p M has a natural basis {d/dx^} on A frame u — {Xi, . . . , X m } 
at p is expressed as 


X a =X tl a d/dx tl \ p 1 <a<m (9.67) 

where (X^ a ) is an element GL(777 , ffi) so that {Xq.} are linearly independent. We 
define the local trivialization 0,- : (7/ x GL(???,IR) — >• n~ l (Uj) by <pE l (u) — 
(p, (X^a .)). The bundle structure of LM is defined as follows. 

(i) If u = {Xi, . . . , X,,,} is a frame at p, we define ttl : LM -> M by 
tx l (u) = p. 

(ii) The action of a — (a‘ j) e GL(m, IK) on the frame u = {Xj, . . . , X,„} is 
given by (u, a) i->- ua, where ua is a new frame at p, defined by 


Yp = X a a a p. 


(9.68) 



Conversely, given any frames {X u } and { Yp ) there exists an element of 
GL(w, E) such that (9.68) is satisfied. Thus, GL(m, E) acts on LM 
transitively. 

(iii) Let {/,• and U j be overlapping charts with the coordinates x IM and y^, 
respectively. For p e U, fl Uj , we have 

x a = X^vd/dx^p = X^ad/dy^lp (9.69) 

where (X^a), ( X „ ) e GL(m, E). Since X^ a — (dx fJ '/dy v ) p X fi a , we find 
the transition function f/; ( p) to be 

tfjip) = ((dx^/dy v ) p ) e GL(m , E). (9.70) 

Accordingly, given T M, we have constructed a frame bundle LM with the 
same transition functions. 

In general relativity, the right action corresponds to the local Lorentz 
transformation while the left action corresponds to the general coordinate 
transformation. It turns out that the frame bundle is the most natural framework in 
which to incorporate these transformations. If \X a ) is normalized by introducing 
a metric, the matrix (X^a) becomes the vierbein and the structure group reduces 
to O (m); see section 7.8. 

Example 9.12. A spinor field on M is a section of a spin bundle which we now 
define. Since GLIA', E) has no spinor representation, we need to introduce an 
orthonormal frame bundle whose structure group is SO(A ). As we mentioned in 
example 4. 12, SPIN(A') is the universal covering group of SO(A). [To define a spin 
bundle, we have to check whether the SO (k) bundle lifts to a SPIN(A') bundle over 
M. The obstruction to this lifting is discussed in section 1 1.6.] 

To be specific, let us consider a spin bundle associated with the four- 
dimensional Lorentz frame bundle LM , where M is a four-dimensional Lorentz 
manifold. We are interested in a frame with a definite spacetime orientation as 
well as a time orientation. The structure group is then reduced to 

0|(3, 1) = {A e 0(3, 1)| detA = +1, A 0 ° > 0}. (9.71) 

The universal covering group of Ot(3, 1) is SL(2, Q, see example 5.16(c). The 
homomorphism cp : SL(2, C) -> OlJ" (3, 1) is a 2 : 1 map with ker<p = {I 2 , —h}- 
The Weyl spinor is a section of the fibre bundle (VT, n , M. C 2 , SL(2, C)). The 
Dirac spinor is a section of 

(D, ;r , M, C 4 , SL(2, Q © SL(2, Q). (9.72) 

A section of W is a (1/2,0) representation of OlJ"(3, 1) and a section of 
(IT, jr, M, C 2 , SL(2, C)) is a (0, 1/2) representation, see Ramond (1989) for 
example. A Dirac spinor belongs to (1/2, 0) © (0, 1/2). 

The general structure of the spin bundle will be worked out in section 1 1.6. 



9.4.3 Triviality of bundles 


A fibre bundle is trivial if it is expressed as a direct product of the base space and 
the fibre. The following theorem gives the condition under which a fibre bundle 
is trivial. 

Theorem 9.2. A principal bundle is trivial if and only if it admits a global section. 

Proof. Let ( P , n, M, G ) be a principal bundle over M and let s e T(M, P) 
be a global section. This section may be used to show that there exists a 
homeomorphism between P and M x G. If a is an element of G, the product 
s(p)a belongs to the fibre at p. Since the right action is transitive and free, any 
element u e P is uniquely written as s(p)a for some p e M and a e G. Define 
a map O : P — > M x G by 


$ : s(p)a i->- ( p,a ). (9.73) 

It is easily verified that <J> is indeed a homeomorphism and we have shown that P 
is a trivial bundle M x G. 

Conversely, suppose F = MxG. Let <fi : M x G —> P be a trivialization. 
Take a fixed element g e G. Then s g : M — > P defined by s g (p) = cp(p , g) is a 
global section. □ 

Is there a corresponding theorem for vector bundles? We know that any 
vector bundle admits a global null section. Thus, we cannot simply replace P by 
E in theorem 9.2. Let us consider the associated principal bundle P(E) of E. 
By definition, E and P(E) share the same set of transition functions. Since the 
twisting of a bundle is described purely by the transition functions, we obtain the 
following corollary. 

Corollary 9.2. A vector bundle E is trivial if and only if its associated principal 
bundle P(E) admits a global section. 

Problems 

9.1 Let L be the real line bundle over .S' 1 (i.e. L is either the cylinder .S 1 x IP or 
the Mobius strip). Show that the Whitney sum L © L is a trivial bundle. Sketch 
L © L to confirm the result. 

9.2 Let Q n be the volume element of S n normalized as f s „ Q n — 1. Let 
/ : S 2n ~ l — »■ S n be a smooth map and consider the pullback f*^l n . 

(a) Show that f*Q n is closed and written as do>„_i, where &>„_ i is an (n — 1)- 
form on S 2 " -1 . 

(b) Show that the Hopf invariant 

H(f) = f 

Js 2 ' 





is independent of the choice of u>„-\. 

(c) Show that if / is homotopic to g, then H( f) — H(g). 

(d) Show that H(f) = 0 if n is odd. [Hint: Use w n -\ A d&)„_i = ^d(<a„_i A 

l )•] 

(e) Compute the Hopf invariant of the map n : .S' 3 -> .S' 2 defined in example 9.9. 
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CONNECTIONS ON FIBRE BUNDLES 


In chapter 7 we introduced connections in Riemannian manifolds which enable us 
to compare vectors in different tangent spaces. In the present chapter connections 
on fibre bundles are defined in an abstract though geometrical way. 

We first define a connection on a principal bundle. Our abstract definition 
is realized concretely by introducing the connection one-form whose local form 
is well known to physicists as a gauge potential. The Yang-Mills field strength 
is defined as the curvature associated with the connection. A connection on a 
principal bundle naturally defines a covariant derivative in the associated vector 
bundle. We reproduce the results obtained in chapter 7, applying our approach to 
tangent bundles. We conclude this chapter with a few applications of connections 
to physics: to gauge field theories and Berry’s phase. We follow the line of 
Choquet-Bruhat el al (1982), Kobayashi (1984) and Nomizu (1981). Details will 
be found in the classic books by Kobayashi and Nomizu (1963, 1969). See also 
Daniel and Viallet (1980) for a quick review. 

10.1 Connections on principal bundles 

There are several equivalent definitions of a connection on a principal bundle. 
Our approach is based on the separation of tangent space T U P into ‘vertical’ 
and ‘horizontal’ subspaces. Although this approach seems to be abstract, it is 
advantageous compared with other approaches in that it clarifies the geometrical 
pictures involved and is defined independently of special local trivializations. 
Connections are also defined as g-valued one-forms which satisfy certain axioms. 
These definitions are shown to be equivalent. 

We briefly summarize the basic facts on Lie groups and Lie algebras, since 
we shall make extensive use of these (see section 5.6 for details). Let G be a 
Lie group. The left action L g and the right action R g are defined by L g h = gh 
and R g h — hg for g,lr e G. L g induces a map L g * : 7/,(G) — »■ T g h(G). A 
left-invariant vector field X satisfies L g *X |/, = X\ g h- Left-invariant vector fields 
form a Lie algebra of G, denoted by q. Since X e g is specified by its value at the 
unit element e, and vice versa , there exists a vector space isomorphism g = T e G. 
The Lie algebra g is closed under the Lie bracket, [ 7„ . 7jg] = f a p Y T Y where { T a } 
is the set of generators of g. f a p Y are called the structure constants. The adjoint 
action ad : G G is defined by ad A , h = ghg~ l . The tangent map of ad g is 


called the adjoint map and is denoted by Ad,, : 7), (G) —> T ghg ~ 1 (G). If restricted 
to T e (G) — g, Ad ? maps g onto itself; Ad g : g -> g as A !->■ gAg , A e g. 

10.1.1 Definitions 

Let u be an element of a principal bundle P(M, G ) and let G p be the fibre at 
p = 7 t(u). The vertical subspace V U P is a subspace of T„P which is tangent to 
G p at u. [Warning: T U P is the tangent space of P and should not be confused 
with the tangent space T p M of M.] Let us see how V U P is constructed. Take an 
element A of g. By the right action 


^exp (tA) u — M exp(M) 

a curve through u is defined in P . Since ti(u ) = i r(u exp(rA)) = p, this curve 
lies within G p . Define a vector A # e T„P by 

A # /(«) = A/ (H exp(/A))| f=0 (10.1) 

dr 

where / : P — ► ffi is an arbitrary smooth function. The vector A # is tangent to 
7 at m, hence A # e V U P . In this way we define a vector A # at each point of P 
and construct a vector field A # , called the fundamental vector field generated 
by A. There is a vector space isomorphism $ : g — ► V„ P given by A h> A # . 
The horizontal subspace H t , P is a complement of V„ P in T„ P and is uniquely 
specified if a connection is defined in P. 

Exercise 10.1. 

(a) Show that jr*X = 0 for X e V U P. 

(b) Show that (t preserves the Lie algebra structure: 

[A # , B # ] = [A, B] # . (10.2) 

Definition 10.1. Let P( M . G) be a principal bundle. A connection on P is a 
unique separation of the tangent space T u P into the vertical subspace V„ P and 
the horizontal subspace H u P such that 

(i) T U P = H U P® V U P. 

(ii) A smooth vector field X on P is separated into smooth vector fields 
X H e H U P and X v e V U P as X = X H + X v . 

(iii) H U gP = R g *H u P for arbitrary u e P and g € G: see figure 10.1. 

The condition (iii) states that horizontal subspaces H U P and H llg P on 
the same fibre are related by a linear map R g * induced by the right action. 
Accordingly, a subspace H u P at u generates all the horizontal subspaces on the 
same fibre. This condition ensures that if a point u is parallel transported, so is its 
constant multiple ug, g e G; see later. At this point, the reader might feel rather 



p 


Figure 10.1. The horizontal subspace H ug P is obtained from H u P by the right action. 


uneasy about our definition of a connection. At first sight, this definition seems 
to have nothing to do with the gauge potential or the field strength. We clarify 
these points after we introduce the connection one-form on P . We again stress 
that our definition, which is based on the separation T u P — V u P © H u P , is purely 
geometrical and is defined independently of any extra information. Although the 
connection becomes more tractable in the following, the geometrical picture and 
its intrinsic nature are generally obscured. 

10.1.2 The connection one-form 

In practical computations, we need to separate T U P into V u P and H U P in a 
systematic way. This can be achieved by introducing a Lie-algebra-valued one- 
form co e p <8> T* P called the connection one-form. 


Definition 10.2. A connection one-form co e g®T*P is a projection of T u P onto 
the vertical component V U P ~ g. The projection property is summarized by the 
following requirements, 


(i) 

co(A # ) — A Aeg 

(10.3a) 

(ii) 

R*co — Ad ? -i co 

(10.3b) 

that is, for X e T„P, 



R* g co ug (X ) 

— 0>ug(R g *X) — g a>u(X)g. 

(10.3b') 

Define the horizontal subspace H u P by the kernel of a>, 


HuP 

= {XeT u P |©(X) = 0}. 

(10.4) 



To show that this definition is consistent with definition 10.1, we prove the 
following proposition. 


Proposition 10.1. The horizontal subspaces (10.4) satisfy 

Rg*H u P — H ug P. (10.5) 

Proof. Fix a point u e P and define H u P by ( 10.4). 'lake X e //„ P and construct 
R V ,.X <= T ug P. We find 


co(R g *X) = R*a>(X) = g~ 1 co(X)g = 0 

since co(X) = 0. Accordingly, R<,. t X e HugP ■ We note that /(„* is an invertible 
linear map. Hence, any vector Y <= H ug P is expressed as Y = R g *X for some 
X e H U P. This proves (10.5). □ 

We have shown that the definition of the connection one-form co is equivalent 
to that of the connection, since co separates T u P into //„ P © V u P in harmony with 
the axioms of definition 10.1. The connection one-form co defined here is known 
as the Ehresmann connection in the literature. 

10.1.3 The local connection form and gauge potential 

Let {£/;} be an open covering of M and let a ; be a local section defined on each 
Uj . It is convenient to introduce a Lie-algebra-valued one-form A, on U\ , by 

Ai=o*(o&Q®Q}{Ui). (10.6) 

Conversely, given a Lie-algebra- valued one-form At , on Uj, we can reconstruct a 
connection one-form w whose pullback by a * is A; . 

Theorem 10.1. Given a g-valued one-form A,- on (7, and a local section a,- : (/, -> 
7 r _1 ((/,-), there exists a connection one-form co such that A-, — o*a>. 


Proof. Let us define a g-valued one-form co on P by 

ooi = gf l Jt*Aigi + gf' d P gi (10.7) 

where dp is the exterior derivative on P and g t is the canonical local 
trivialization defined by ff 1 (u ) — ( p. g,) for u — ctj(p)gj. We first show 
that erf co i — A[. For X e T p M , we have 

cr*coi(X) = coi((Ji*X) = n*Ai((Ti*X) + d pgticr^X) 

— Ai(n*Oj*X) + d pgj(cJi*X) 




u, P X 


Figure 10.2. The canonical local trivialization defined by the local section a, over U[ . 


where we have noted that <7,*X e T aj P and g, — e at cr/, see figure 10.2. We 
further note that n *er,* = id t p (M) and d pgi{Oi*X) — 0 since g = e along 
Thus, we have obtained o*a>j(X) = At (X). 

Next we show that &>,• satisfies the axioms of a connection one-form given in 
definition 10.2. 

(i) Let X — A # e V„P,A e g. It follows from exercise 10.1(a) that 
7t*X = 0. Now we have 

VIA*) = gT ' Apgl (A*) = *(«)-' dg(H 7 (M)) 

dt t = o 

, x -i , , d exp(rA) 

= gi(u) gi(u ) —A. 

dt t = o 

(ii) Take X e T U P and h e G. We have 

P h ('g ( X) — 0)j ( R/i t . X ) — gi u f l 'A-j(7T*R-h*X')gi u f l T g nl / f dpgiuh ( Ph X ) . 
Since = gj u h and n*Rh*X = tt*X (note that nR/, = n), we have 

R* h coi(X) = h~ l gxj Ai(n*X)gj u h + h~ l gX l d P g iu (X)h 
= h~ l coj(X)h 

where we have noted that 

-l -l d 

Si u h d P§iuh (Rh*X) &iuh ^ Xiy(t)h 


h 8 iu 37S1XO h = h ' Shi dp giu (X)h. 




Here y (t) is a curve through u = y (0), whose tangent vector at u is X. 

Hence, the g-valued one-form a>j defined by ( 10.7) indeed satisfies Aj — 
cr*a>i and the axioms of a connection one-form. □ 

For (o to be defined uniquely on P, i.e. for the separation T u P — //„ P@V U P 
to be unique, we must have Wj — coj on Uj fl Uj. A unique one-form to is then 
defined throughout P by co\u i — To fulfil this condition, the local forms A / 

have to satisfy a peculiar transformation property similar to that of the Christoffel 
symbols. We first prove a technical lemma. 

Lemma 10.1. Let P(M , G) be a principal bundle and er, (oj) be a local section 
over Uj (Uj) such that Uj A Uj 0. For X e T p M (p e Uj A Uj), ct,*Z and 
o/*X satisfy 

<rj*X = R tlJ *M*X) + (tA l d tjj (X)) # (10.8) 

where tjj : Uj fl U j — > G is the transition function. 

Proof. Take a curve y : [0, 1] — > M such that y(0) = p and y(0) = X. Since 
csj(p) and Oj(p) are related by the transition function as crj(p) — Oj ( p)tjj(p) 
(see (9.43)), we have 

d d 

Oj*X= —aj(y(t)) = — {oj(t)tjj(t)} 

t= o t= o 

d d 

= —Oj (t) ■ tjj (p) + Oj (p) ■ —tjj (t) 

at at f q 

d 

= Rtjj*(^i*X) + oj (p)tjj (p) —tjj (t) 

at t = o 

where Oj ( t ) stands for Oj (y (t)) and we have assumed that G is a matrix group for 
which R v , t X = Xg. We note that 

-i -i d 

tjj(p) d tjj (X) = tjj(p) —tij(t) 

at t = 0 

= -rltij(pr'tij(t)] e T e (G) = g. 
at t= o 

[Note that tjj(p)~ l tjj(y(t)) = e at t = 0.] This shows that the second term of 

1 jj 

Oj*X represents the vector field (t ;/ . dtjj (X)) at Oj(p). □ 

The compatibility condition is easily obtained by applying the connection 
one-form tx> on ( 10.8). We find that 

Oj(o(X) = R*.co(oi*X) + tf. ] dtjj (Z) 

= tX l co(oj*X)tij + tX 1 d tjj (X) 



where the axioms of definition 10.2 have been used. Since this is true for any 
X e T p M , this equation reduces to 

Aj — tjj l Aitij + t- 1 d tjj . (10.9) 

This is the compatibility condition we have been seeking. 

Conversely, given an open covering {(/;}, the local sections {er,} and the 
local forms {Ai} which satisfy (10.9), we may construct the g-valued one-form a> 
over P. Since a non-trivial principal bundle does not admit a global section, the 
pullback Ai = o* to exists locally but not necessarily globally. In gauge theories, 
At is identified with the gauge potential (Yang-Mills potential). As we have 
seen in the monopole case, the monopole field B — gr/r 3 does not admit a 
single gauge potential and we require at least two Ai to describe this U(1 ) bundle 
over S 2 . 

Exercise 10.2. Let P(M, G) be a principal bundle over M and let U be a chart of 
M. Take local sections oq and oq over U such that 02 (p) — o\{p)g{p). Show 
that the corresponding local forms A \ and A 2 are related as 

A 2 =g~ l A\g + g~ l dg. (10.10a) 

In components, this becomes 

Alp = g~ l (p)Aip(p)g(p) + g~ l {p)dpg(p) (10.10b) 

which is simply the gauge transformation defined in section 1.8. 

Example 10.1. Let P beaU(l) bundle over M. Take overlapping charts (/,- and 
Uj. Let A/ ( Aj ) be a local connection form on Ui (U /). The transition function 
tjj : Ui n Uj U(l) is given by 

tij(p) = exp[iA(/?)] A {p) e ffi. (10.11) 

Ai and Aj are related as 

Aj(p) =tij (pr'Ai(p)tij ip) + tij (p ) _1 d tij {p) 

=Ai(p) + id A(p). (10.12a) 

In components, we have the familiar expression 

A jll =Ai ll + idpA. (10.12b) 

Our connection A t , differs from the standard vector potential A /t by the Lie 
algebra factor: Ap — i A^. 

Here we note again that co is defined globally over the bundle P( M . G). 
Although there are many connection one-forms on P(M, G), they share the same 
global information about the bundle. In contrast, an individual local piece (gauge 
potential) Aj is associated with the trivial bundle 7t~ l (Ui) and cannot have any 
global information on P. It is to or, equivalently, the total of { A , } satisfying 
the compatibility condition (10.9), which carries the global information about the 
bundle. 



10.1.4 Horizontal lift and parallel transport 

Parallel transport of a vector has been defined in chapter 7 as transport without 
change. Parallel transport of an element of a principal bundle along a curve in M 
is provided by the ‘horizontal lift’ of the curve. 

Definition 10.3. Let P(M , G) be a G bundle and let y : [0, 1] -» M be a curve 
in M. A curve y : [0, 1] — > P is said to be a horizontal lift of y if n o y — y 
and the tangent vector to y (t) always belongs to Hy {t) P . 

Let X be a tangent vector to y . Then it satisfies o> ( X ) = 0 by definition. 
This condition is an ordinary differential equation (ODE) and the fundamental 
theorem of ODEs guarantees the local existence and uniqueness of the horizontal 
lift. 


Theorem 10.2. Let y : [0, 1] M be a curve in M and let uq e 7t HyCO)). 
Then there exists a unique horizontal lift y (t) in P such that y (0) = uq. 

Let us construct such a curve y. Let [7/ be a chart which contains y and 
take a section a ; over tJ j . If there exists a horizontal lift y . it may be expressed 
as y(t) = Oi(y{t))gj(t), where gfit) stands for gi(y(t)) e G. Without loss of 
generality, we may take a section such that er;(y( 0)) = y (0), that is gi( 0) = e. 
Let X be a tangent vector to y(t) at y( 0). Then X = y*X is tangent to y at 
mo = y(0). Since the tangent vector X is horizontal, it satisfies a>(X) — 0. A 
slight modification of lemma 10.1 yields 

* = gi(tr l vi*Xgi(t) + [g ; (0 _1 d gi (X)] # . 


By applying u> on this equation, we find 

o = w(x) = giitr'coioi^giit) + gi{tr ^ 


Multiplying on the left by gj (f), we have 


d gj{t) 
dr 


co(cji*X)gi(t). 


(10.13a) 


The fundamental theorem of ODEs guarantees the existence and uniqueness of 
the solution of (10.13a). 

Since ®(cr,*X) = a*co(X) — Aj(X), (10.13a) is expressed in a local form 


as 


d giit) 
d t 


= - Ai(X) gi (t ) 


whose formal solution with gi (0) = e is 


(10.13b) 


where (P is a path-ordering operator along y(t). 1 The horizontal lift is expressed 
as y(t) = (Ji(y(t))gi(y(t)). 

Corollary 10.1. Let y' be another horizontal lift of y, such that y'( 0) = y(0)g. 
Then y'(t ) = y(t)g for all t e [0, 1], 


Proof. We first note that the horizontal subspace is right invariant, R g *H u P — 
H ug P . Let y be a horizontal lift of y . Then y g : t i-> y(t)g is also a horizontal 
lift of y{t) since its tangent vector belongs to Hy g P. From theorem 10.2 we find 
y' is the unique horizontal lift which starts at y (0)g. □ 


Example 10.2. Let us consider the bundle P(M. IK) = M x M where M — 
M 2 — {0}. Let f : ((x, y), f) ht k e P be a local trivialization, where (x, y) are 
the coordinates of M while f is that of the additive group IK. Let 


ydx — xdy 
x 2 + y 2 


+ df 


be a connection one-form. It is easily verified that co satisfies the axioms of 
the connection one-form. In fact, for A # = Ad/df, del being an element 
of the Lie algebra of additive group, we have &>(A # ) = A. Furthermore, 
R g:¥ a> = co = g -1 cog, since IK is Abelian. Let y : [0, 1] -* M be a 
curve t (cos27Tf, sin27Tt). Let us work out a horizontal lift which starts at 
((1,0), 0). Let 

d d.r 3 dy 3 df 3 

dr dr dx dr 3y dr 3/ 

be tangent to yit). For X to be horizontal, it must satisfy 


0 = co(X) = 


dx y 

dr r 2 


d /__ 9 d / 

dr r 2 dr _7r dr ■ 


The solution is easily found to be / = 2nt + constant. We finally find the 
horizontal lift y passing through ((1, 0), 0), 


y(r) = ((cos27rr, sin2^r), 2 nt) 


(10.15) 


which is a helix over the unit circle. 

Under the group action (right or left does not matter), / translates to 
f + g, g € IK. The shifted horizontal lift is 


y g (t) — ((cos27rr, sin27r r), 2nt + g). (10.16) 

1 Ajy and A iV (y(s ) ) do not commute in general and the exponential in (10.14) is not well 

defined as it is. Let A(t) and B(t) be /-dependent matrices. Then the action of IP is 


T[A(r)B(i)] = 


I A(t)B(s) 
|b0)A(0 


(/ > s) 
(s > t). 


Generalization to products of more matrices should be obvious. 




Figure 10.3. A curve y(t) in M and its horizontal lifts y(t) and y(t)g. 

Let y '■ [0, 1] — > M be a curve. Take a point u o e 7r _1 (y(0)). There is 

a unique horizontal lift y (t) of y (t) through mo, and hence a unique point u i = 

y( 1) e tt~ 1 (y ( I )), see figure 10.3. The point u\ is called the parallel transport 
of uq along the curve y. This defines a map T(y) : 7r -1 (y(0)) — >• 7r _1 (y(l)) 
such that «o u\. If the local form (10.14) is employed, we have 

_ / d x^(y(t)) \ 

mi =t7 / (l)0>exp^-y Ai lL ^ 10 - 17 ) 

Corollary 10.1 ensures that T(y) commutes with the right action /L, . First 
note that R g r(y)(uo) = u\g and T(y)l? ? (Mo) = r(y)(i<og). Observe that y(t)g 
is a horizontal lift through iiQg and it \ g . From the uniqueness of the horizontal 
lift through nog, we have mg = r(y)(uog), that is R g r(y)(uo) — r(y)R ? (uo). 
Since this is true for any u o e 7r _1 (y (0)), we have 

^gF(y) = r(y)f?g. (10.18) 

Exercise 10.3. Let y be a horizontal lift of y : [0, 1] — > M. Consider a map 
F(y _1 ) : 7T — 1 (y (1)) — > 7r _1 (y(0)) where y _1 (t) = y(l — t ). Show that 


r(y _1 ) = r(y) -1 . 


(10.19) 



Consider two curves a : [0, 1] —>■ M and p : [0, 1] — > M such that 
crfl) = yS (0). Define the product a * p by 

\a(2 1) 0 < t < \ 

a * P — < , 

| /j(2? — 1) \<t< 1. 

Let T(<5) : tt-'MO)) Tr-^afl)) and V{fi) : tt -1 (/3(0)) -» tt - 1 (/3(1)). 
Show that 

rca * p) = r (P) o r(a). ( 10 . 20 ) 

Exercise 10.4. Let us write u ~ v, if u, v e P are on the same horizontal lift. 
Show that ~ is an equivalence relation. 

10.2 Holonomy 
10.2.1 Definitions 

Let P(M, G) be a principal bundle and let y : [0, 1] — »• M be a curve whose 
horizontal lift through mo e 7r _1 (/(0)) is y. In the last section, we defined 
a map V (y) : jr _1 ()/(0)) -> tt — 1 (y (1)) which maps a point uq — y(0) to 
mi = y( 1). Let us consider two curves cr, p : [0, 1] — > M witha(0) = P(0) — po 
and « ( 1 ) = yS ( 1 ) = p\ . Take horizontal lifts a and p of a and p such that 
a(0) = P{ 0) = uq. Then a(l) is not necessarily equal to P( 1). This shows that if 
we consider a loop y : [0, 1] -> M at p — y(0) = y (1), we have y (0) ^ y (1 ) in 
general. A loop y defines a transformation r y : 7t~ 1 (p) — > 7t~ l {p) on the fibre. 
This transformation is compatible with the right action of the group, 

r y (ug) = tyOOg (10.21) 

which follows immediately from (10.18). We note that r y depends not only on 
the loop y but also on the connection. 

Example 10.3. Consider an M-bundle over M = K 2 — {0}. The connection 
one-form a> and the loop y in example 10.2 define a map r y : zr — 1 (( 1 , 0)) -* 
7T — 1 ((1, 0)) given by gh>g + 2jt, gel 

Take a point u e P with : r(u) = p and consider the set of loops C p (M ) at 
p\ Cp(M) = {y : [0, 1] — > M|y(0) = y( 1) = pj. The set of elements 

0 )( = {g e G | T y (m ) = ug, y e C p {M)} (10.22) 

is a subgroup of the structure group G and is called the holonomy group at u. The 
group property of <t>„ is easily derived from exercise 10.3. If a, p and y — a * p 
are loops at p, we have r y = tp o r a , hence 


X y {u) = Xpo X a (li) = Xp(ug a ) = Xp(u)g a = Ugpg a 



where r a (u) = ug a etc. This shows that 


gy = gpga- (10.23) 

The constant loop c : [0, 1] i-a- p defines the identity transformation 
x c : u i — y u . The inverse loop y 1 of y induces the inverse transformation 
Ty-i = ty 1 , hence g y -i = g~ l . 

Exercise 10.5. (a) Let r a (u) = ug a . Show that 

r a (ug) = ug(ad g g a ) = ug(g~ l g a g). (10.24) 

Verify that 

4> ua=a- l <S> u a. (10.25) 

(b) Let u, u' e P be points on the same horizontal lift y . Show that 

= 4 >«'. 

(c) Suppose that M is connected. Show that all <f>„ are isomorphic to each 
other. 

Exercise 10.6. Let .4, = .A, /; d.L' be a gauge potential over U, and y a loop in 
Uj. Let Ty{u) = ug y , u e P, gy e G. Use (10.14) to show that 

g y = fPexp ^ ^ A ilx (10.26) 

Let C°(M) denote the set of loops at p, which are homotopic to the constant 
loop at p. The group 

= {g e °\ = u 8’ V e C°p(M)} (10.27) 

is called the restricted holonomy group. 

10.3 Curvature 

10.3.1 Covariant derivatives in principal bundles 

We defined the exterior derivative d : Q r (M ) — »• fi r+l (M ) in chapter 5. An 
r-form i] is a real-valued form acting on vectors, 

r ] : TM A ... A TM -» ffi. 

We will generalize this operation so that we can differentiate a vector-valued r- 
form (f> e £i r (P) ® V, 

(p : TP A . . . A TP -* V 

where V is a vector space of dimension k. The most general form of (j> is 
4> = Y-!u=\ ^ ® e a- {e a } being a basis of V and <p a e f2' (P). 


A connection ®ona principal bundle P(M . G) separates T U P into H U P © 
V„P. Accordingly, a vector X e T U P is decomposed as X = X H + X ' where 
X H e H U P and X v e V U P. 

Definition 10.4. Let 4> e Y2'( P) ® V and X \, . . . , X r+ \ e T U P. The covariant 
derivative of </> is defined by 

D0 (X i , . . . , Xr+i) = d P </>(X? , . . . , xf +] ) (10.28) 

where dp </> = dp <j) a <g> e a . 

10.3.2 Curvature 

Definition 10.5. The curvature two-form f2 is the covariant derivative of the 
connection one-form a>, 

Q = Deo e Qr{P) <g> g. (10.29) 

Proposition 10.2. The curvature two-form satisfies (cf (10.3b)) 

R*£l = a~ l na a e G. (10.30) 

Proof. We first note that {R a *X) H — R a *(X H ) (/? fl * preserves the horizontal 
subspaces) and dp/?* = R* d p , see (5.75). By definition we find 

R*£2(X, Y) = Cl(R a *X, R a *Y) = d P co((R a *X) H , ( R a *Y ) H ) 

= dp&> (. R a *X H , R a *Y H ) = R* d pen (X H , Y h ) 

= d P R*m(X H . Y h ) 

= dp (a~ l a>a)(X H , Y H ) = a~ l dpa> (X H , Y H )a 
= a~ l Q(X, Y)a 

where we noted that a is a constant element and hence dpo = 0. □ 

Take a g-valued p-form f ® T a and a g-valued g-form q — if ® T a 

where f a e £l p {P), rf 1 e £2 9 ( P), and { 7^ } is a basis of g. Define the commutator 
of f and q by 

[C rj] = C A t] - (—l) pq q A if 

= TaTpS 01 A if - (~l) pq TpT a ri p A r 
= [Ta, Tp] ® r A if = f a p y Ty ® r A rf . (10.31) 

If we put f — q in (10.31), when p and q are odd, we have 
[?. ?] = 2(A( = f a p Y Ty ® r A t;f 
Lemma 10.2. Let X e H U P and Y e V U P. Then [X, Y] e H U P . 



Proof. Let F be a vector field generated by g(t), then 

C Y X = [Y, X] = lim r l (R s(t uX - X). 

Since a connection satisfies R g *H u P — H ug P, the vector R g ( t )*X is horizontal 
and so is [F, X], □ 

Theorem 10.3. Let X .Y <= T U P. Then Q and o> satisfy Cartan’s structure 
equation 

n(X, Y) =d pco (X, Y) + [co{X), a>(F)] (10.32a) 

which is also written as 

£2 = dpffl + wAw. (10.32b) 

Proof. We consider the following three cases separately: 

(i) Let X, Y e H„P. Then co ( X ) — co(Y ) = 0 by definition. From definition 
10.5, we have £2(X, Y ) = d pa)(X H , Y H ) = dpa>(X, Y ), since X — X H and 
= Y H . 

(ii) Let X e H„P and Y e V„P. Since Y H — 0, we have Q( X, Y ) = 0. We 
also have a> (X) = 0. Thus, we need to prove dpa> (X, Y) = 0. From (5.70), we 
obtain 

d pm (X, Y) = Xw{Y) - Yco(X) - w([X, T]) = Xco(Y) - co([X, F]). 

Since Y e V u P , there is an element V eg such that Y — V # . Then o>(Y) = V is 
constant, hence Xa)(Y) — X ■ V — 0. From lemma 10.2, we have [X , F] e H U P 
so that co([X, F]) = 0 and we find d pco ( X , F) = 0. 

(iii) For X, Y e V u P , we have Q (X, F) = 0. We find that, in this case, 

d pco (X, Y) = Xco{Y) - Yco (X) - co{[X , F]) = -<w([X, F]). 

We note that X and F are closed under the Lie bracket, [X, F] e V„P, see exercise 
10.1(b). Then there exists A eg such that 

co([X, F]) = A 

where A # = [Z, F], Let B # — X and C # = F. Then [ a>(X ), <y(F)] = [£, C] = 
A since [B, C] # = [B # , C # ]. Thus, we have shown that 

0 = d pco (X, F) + co([X, F]) = dpto (X, Y) + [«(Z), co(F)]. 

Since L! is linear and skew symmetric, these three cases are sufficient to show 
that (10.32) is true for any vectors. 

To derive (10.32b) from (10.32a), we note that 

[ft), ft)](X, F) = [T a , Tf)]co a A ft/(X, F) 

= [T a , T p ][co a (X)w fi (Y) - a>P (X)(o a (Y)] 

= [co(X), <u(F)] - [a>(F), ft>(Z)] = 2[w(X), ft>(F)]. 

Flence, f2(Z, F) = (dpcn + j[cd, co])(X, F) = (d pco + co A co)(X, F). □ 



10.3.3 Geometrical meaning of the curvature and the Ambrose-Singer 
theorem 

We have shown in chapter 7 that the Riemann curvature tensor expresses the non- 
commutativity of the parallel transport of vectors. There is a similar interpretation 
of curvature on principal bundles. We first show that C ( A , Y ) yields the vertical 
component of the Lie bracket [A, Y] of horizontal vectors A, Y e H U P. Itfollows 
from co(X) = u>(Y) = 0 that 

d pco (A, Y) = Xco(Y) - Yco(X) - a>([A, T]) = -a>([A, Y]). 

Since X H = X, Y H = Y , we have 


Q(A, Y) = d pco (X, Y) = -®([A, Y]). (10.33) 

Let us consider a coordinate system {x^} on a chart U . Let V — d/dx 1 and 
W = d/dx 2 . Take an infinitesimal parallelogram y whose corners are O = 
{0, 0, . . . , 0}, P = {e, 0, . . . , 0}, Q = {e,5,0,..., 0} and R = {0, 5, 0, ... , 0}. 
Consider the horizontal lift y of y. Let X, Y e H U P such that tr*A = e V and 
jr*L = 8W. Then 


ir*dX, Y] h ) = eS[V, W] = e8 


"9 9 ' 

9X 1 ’ 9x 2 


= 0 


(10.34) 


that is [A, Y] is vertical. This consideration shows that the horizontal lift y of 
a loop y fails to close. This failure is proportional to the vertical vector [X, Y] 
connecting the initial point and the final point on the same fibre. The curvature 
measures this distance, 


Q (A, Y) = — &>([A, Y]) = A (10.35) 

where A is an element of g such that [A, Y] = A # . 

Since the discrepancy between the initial and final points of the horizontal 
lift of a closed curve is simply the holonomy, we expect that the holonomy group 
is expressed in terms of the curvature. 

Theorem 10.4. (Ambrose-Singer theorem) Let P(M, G) be a G bundle over a 
connected manifold M. The Lie algebra f) of the holonomy group <t>„ 0 of a point 
mo e P agrees with the subalgebra of g spanned by the elements of the form 

S2 U (A , F) A ,YeH u P (10.36) 

where a e P is a point on the same horizontal lift as uq. [See Choquet-Bruhat et 
al (1982) for the proof.] 


10.3.4 Local form of the curvature 


The local form £F of the curvature Q is defined by 

J=a*Q. (10.37) 

where it is a local section defined on a chart U of M (cf A = cr*co). 9" is expressed 
in terms of the gauge potential A as 

3 = dA + AAA (10.38a) 

where d is the exterior derivative on M. The action of T on the vectors of TM is 
given by 

T(X, Y) = dA (X, Y) + [.A(X), A{Y)]. (10.38b) 

To prove (10.38a) we note that A — cr*co, a* d pa> = d a*u> and ct*(£ A rj) — 
cr*t; A <7 * i] . From Cartan’s structure equation, we find 

S' = a*(dpa> + co A co) — d cr*a> + a*ca A (j*a> — dA + A A A. 

Next, we find the component expression of S' on a chart U whose coordinates 
are r ,J = cp(p). Let A — A^ dx 1 ' be the gauge potential. If we write 
S = ^S /X i, dx IJ A dx v , a direct computation yields 

S 'fiv = d^Av — d v A^ + [. A p, Av\. (10.39) 

S is also called the curvature two-form and is identified with the (Yang-Mills) 
field strength. To avoid confusion, we call Q the curvature and S the (Yang- 
Mills) field strength. Since A fl and ,T /X l; are g-valued functions, they can be 
expanded in terms of the basis \ T a ) of g as 

A fl = A tl a T a S /tv = F flv a T a . (10.40) 

The basis vectors satisfy the usual commutation relations [7’„, Tp] — f a p v T y . We 
then obtain the well-known expression 

= d„A v a - d v A„ a + fft y a A/A/. (10.41) 

Theorem 10.5. Let (/, and U j be overlapping charts of M and let S) and S/ be 
field strengths on the respective charts. On (7/ fl Uj , they satisfy the compatibility 
condition, 

•J; = Ad -i T, = tT% tij 

ij J 

where tjj is the transition function on Ut fl Uj . 


(10.42) 



Proof. Introduce the corresponding gauge potentials Aj and Aj, 


IT j — dAj A Aj A Aj A j — dA j A A j A Aj . 

Substituting Aj — t7 l Ajtjj + tf . 1 dtjj into Aj, we verify that 

•fj = d (tjj -A-itjj A tjj d tjj) 

A (tjj 1 Ajtj/ A tjj 1 d tjj) A ( tjj 1 Ajtjj A tjj 1 d tjj) 

= [ tjj d tjj A tjj Ajtjj A tjj dAj tjj 

- A dt U - hj ' dt v hj 1 A d 'i/] 

+ [tjj ^ Aj A Ajtjj A tjj l Aj A d tjj 
A tjj dr jj tjj A Ajtjj A tjj d tjj A tjj dtjj\ 

— tjj \dAj A Aj A Aj)tjj — tjj ^ Ajtjj 

where use has been made of the identity dr _1 =— r _1 drr _1 . □ 

Exercise 10. 7. The gauge potential A is called a pure gauge if A is written locally 
as A — g _1 dg. Show that the field strength A vanishes for a pure gauge A. [It 
can be shown that the converse is also true. If IT = 0 on a chart U, the gauge 
potential may be expressed locally as A — g~ l dg.] 

10.3.5 The Bianchi identity 

Since o> and Q are g-valued, we expand them in terms of the basis { T a } of g as 
co — co a T a , £2 = £2 a T a . Then (10.32b) becomes 

Q a = dpco 01 A fpy 01 ^ Aft/. (10.43) 

Exterior differentiation of (10.43) yields 

dpf2“ = ff5y a dpco^ A cxf A ffty a u>^Adpco Y . (10.44) 

If we note that oXX ) — 0 for a horizontal vector X, we find 

D£2(X, Y, Z ) = d P £2 ( X H , Y h , Z H ) = 0 

where X,Y,Ze T U P . Thus, we have proved the Bianchi identity 

D£2 = 0. (10.45) 

Let us find the local form of the Bianchi identity. Operating with er* on 
(10.44), we find that ct* d P Q, = d • o*Q. = dT for the LHS and 

o*(dpco A co — co A dpcu) = d o*a> A o*co — cr*&> A der*a> 

= dA A A — A A d/1 — A A A — A aA 



for the RHS. Thus, we have obtained that 


T>3' = d3' + AA3'-3AA = d3+[A, 3“] = 0 (10.46) 

where the action of V on a g- valued p-form i] on M is defined by 

Vrj = d ?7 + [A, ?;]. (10.47) 

Note that — dfFfor G = U(l). 

10.4 The covariant derivative on associated vector bundles 

A connection one-form wona principal bundle P(M, G) enables us to define the 
covariant derivative in associated bundles of P in a natural way. 

10.4.1 The covariant derivative on associated bundles 

In physics, we often need to differentiate sections of a vector bundle which is 
associated with a certain principal bundle. For example, a charged scalar field in 
QED is regarded as a section of a complex line bundle associated with a U(l) 
bundle P(M, U( 1 )). Differentiating sections covariantly is very important in 
constructing gauge-invariant actions. 

Let P{M, G) be a G bundle with the projection 7tp. Let us take a chart (7/ of 
M and a section cr; over ( 7 , . We take the canonical trivialization </>,■ ( p. e) — 07 ( p). 
Let y be a horizontal lift of a curve y : [0, 1] — > Up We denote y( 0) = po 
and y(0) — uq. Associated with P is a vector bundle E = P x p V with the 
projection tte, see section 9.4. Let X e T p M be a tangent vector to y(t) at po. 
Let s e F(M, E) be a section, or a vector field, on M. Write an element of E as 
[(«, v)] = {( ug , p(g) - 1 v|n e P , v e V , g e G}. Taking a representative of the 
equivalence class amounts to fixing the gauge. We choose the following form, 

*(/>) = [(^ (/>),!(/>))] (10.48) 


as a representative. 

Now we define the parallel transport of a vector in E along a curve y in M. 
Of course, a naive guess is parallel transported if §(/(?)) is constant along y(t)’ 
does not make sense since this statement depends on the choice of the section 
a i ( p). We define a vector to be parallel transported if it is constant with respect to 
a horizontal lift y of y in P. In other words, a section s(y(t)) = [(y(t), ij(y (t)))] 
is parallel transported if i] is constant along y(t). This definition is intrinsic since 
if y'(t) is another horizontal lift of y, then it can be written as y'(t) = y(t)a, 
a e G and we have (we omit p to simplify the notation) 

[(.m, nm = [( y'{t)a-\ nit))] = [(y'co, a“V))] 

where p(t) stands for >;(y(?)). Hence, if pit) is constant along y(t), so is its 
constant multiple a~ l rj(t). 



Now the definition of covariant derivative is in order. Let s(p) be a section 
of E. Along a curve y : [0, 1] — > M we have s(t ) = [ (y (f ) , 77 (f))], where y(t) is 
an arbitrary horizontal lift of y(t). The covariant derivative of s(t) along y{t) at 
po — y( 0 ) is defined by 


Vxs 


y(0), 

at 



(10.49) 


where X is the tangent vector to y(t) at po. For the covariant derivative to be 
really intrinsic, it should not depend on the extra information, that is the special 
horizontal lift. Let y'(t) — y(t)a (a e G) be another horizontal lift of y. If y' (t) 
is chosen to be the horizontal lift, we have a representative [(y'(f), £7 _ 1 77 (f))]. 
The covariant derivative is now given by 


y'(0), —{a '77(f)} 
at 


7=0/ J 


y'(0)a 


-1 


’ dr 


77 (r) 


7=0/ J 


which agrees with (10.49). Hence, Vxs depends only on the tangent vector X 
and the sections s e T(M, E ) and not on the horizontal lift y(t). Our definition 
depends only on a curve y and a connection and not on local trivializations. The 
local form of the covariant derivative is useful in practical computations and will 
be given later. 

So far we have defined the covariant derivative at a point po — y(0). It 
is clear that if X is a vector field, Vx maps a section s to a new section V x s, 
hence Vx is regarded as a map T(M, E) — » T(M, E). To be more precise, take 
X e X(M ) whose value at p is X p e T p M. There is a curve y(t) such that 
y{0) — p and its tangent at p is X p . Then any horizontal lift y(t) of y enables 
us to compute the covariant derivative Vx.v | ; , = Vx ; ,.v. We also define a map 
V : T(M, E) T(M, E) ® ^'(M) by 

Vs(X) = V x s X e X(M) s e T(M, E). (10.50) 


Exercise 10.8. Show that 


Vx(fllSl + G2S2 ) = Cll Vx«l + ^2 Vx^2 

(10.51a) 

V(fliXi + Q2S2) = fll V.S1 + £72 VS2 

(10.51b) 

V(a l X 1 +a 2 X 2 )S = £71 Vx^ + £72 ^X 2 S 

(10.51c) 

Vx(/s) = X[f]s + fV x s 

( 1 0.5 1 d) 

X(fs) = (d f)s + / Vs 

(10.5 le) 

V/xs = /Vxs 

( 10.5 If) 


where a,- e M, s, s' e T(M, E) and / e H(M). 



10.4.2 A local expression for the covariant derivative 

In practical computations it is convenient to have a local coordinate representation 
of the covariant derivative. Let P(M, G) be a G bundle and E — P x p G be an 
associate vector bundle. Take a local section er; e T (£/,-, P) and employ the 
canonical trivialization rr, ( p) = 4>i(p, e). Let y : [0, 1 ] -> M be a curve in Uj 
and y its horizontal lift, which is written as 


y(t) = a i (t)g i (t) (10.52) 

where gj(t) = giiyit)) e G. Take a section e a (p) = [(cr,(p), e a °)] of E, where 
Cq, 0 is the crth basis vector of V; ( e a ° )& — (8 a )P . We have 

e a (f) = [() Ht)gi(tr\e a 0 )] = [(y(f), «i(0~ V)L (10.53) 

Note that g,(t)~ l acts on e a ° to compensate for the change of basis along y . The 
covariant derivative of e a is then given by 


Vxe„ = 


y(0), V) 

at 


1 = 0 / J 


y(0), -gi(t) 


-i 


gi(t)\ gi(t) l e a 0 


1=0/ J 


= [(y(O)gi(O)-', Ai(X)e a °)] (10.54) 

where (10.13b) has been used. From (10.54) we find the local expression, 


Vxe„ = [(<7,(0 ),A i (X)e a {> )]. 


(10.55) 


Let A, = Ain d^^ = Ai^pdx^ where Ai^ R = A,-^ (T y ) a R . The second 


entry of (10.55) is 


[1 ft — 'THlL \ l Y) 


A dx^ A O dx^ O A 

Ai(X)e a ° = —e p 0 Ai/ y 8 a y = — . Aj a e p °. 


Substituting this into (10.55), we finally have 

VxCa = 


dX^ a A 

o', (0), —Ain P a ep° 


dX^ a 
= ~fr- A W a e P 


Vcq. = Aj P a ep. 

In particular, for a coordinate curve x^, we have 

Vg/a^^ea = AifY a ep. 


(10.56a) 

(10.56b) 

(10.57) 


It is remarkable that a connection A on a principal bundle P completely specifies 
the covariant derivative on an associated bundle E (modulo representations). 



Exercise 10.9. Lets(p) = [(er, (p), £, (/;))] = %i a (p)e a be a general section of E, 
where l;i(p) = %i a (p)e a ° . Use the results of exercise 10.8 to verify that 





1 Aj (X)^j 




\ 3 Hi 01 

r=o/J 

d t 

[ dx v 


+ Ajfj. 


a 



e a . 

(10.58) 


By construction, the covariant derivative is independent of the local 
trivialization. This is also observed from the local form of V^s. Let <r, ( p) and 
<7j(p ) be local sections on overlapping charts Uj and U j. On U, fl U j, we have 
a j ( p) — <jj (p)tij ( p). In the i -trivialization, the covariant derivative is 




cr, (0), ^- + Ai(X)%i 

at 


t= o. 


' hj > c | f (tijHj) + Ai(X)tjj%j 

a j (0) ’ ~xr + Aj(X)^j 

at 0=0 


r=0/ J 


(10.59) 


where use has been made of the condition (10.9). The last line of (10.59) is Vy.v 
expressed in the j -trivialization. 

We have found that the covariant derivative defined by (10.49) is independent 
of the horizontal lift as well as the local section. The gauge potential A\ 
transforms under the change of local trivialization so that V^.v is a well-defined 
section of E. In this sense, Vy is the most natural derivative on an associated 
vector bundle, which is compatible with the connection on the principal bundle 
P. 


Example 10.4. Let us recover the results obtained in section 7.2. Let FM be a 
frame bundle over M and let I'M be its associated bundle. We note FM — 
P(M , GL(m, K)) and T M — FM x p W n , where m — dimM and p is the m x m 
matrix representation of GL(w, E). Elements of g[(m, K) are m x m matrices. 
Let us rewrite the local connection form A\ as T 01 dx ,x . We then find that 

V 3/3 ^e a = [(cr/ (0) , r M e„ 0 )] = T P^ep (10.60) 

which should be compared with (7.14). For a general section (vector field), 
s(p) =[(cr/(p), Xj(p))] = Xi a (p)e a , we find 

V a /ax*s = (10.61) 

which reproduces the result of section 7.2. It is evident that the roles played by the 
indices a, fl and p in T“ are very different in their characters; p is the Q}{M) 
index while a and /J are the gl(m, ffi) indices. 



Example 10.5. Let us consider the U(l) gauge field coupled to a complex scalar 
field 4>. The relevant fibre bundles are the U(l) bundle I’ ( M , U (I ) ) and the 
associated bundle E — P x p C where p is the natural identification of an element 
of U(l) with a complex number. The local expression for co is Ai = Aj /t dx ,L , 
where A[ jL — Ajld/dx 11 ) is the vector potential of Maxwell’s theory. Let y be 
a curve in M with tangent vector X at y (0) . Take a local section er,- and express 
a horizontal lift y of y as y(t) = er,- If 1 e C is taken to be the basis 

vector, the basis section is 

e = [(er;(p), 111- 

Let <j)(p ) = [(er,(p), <t>(p))] = < &(p)e (<t> : M -> C) be a section of E, which 
is identified with a complex scalar field. With respect to y (t), the section is given 
by 

4>(t) = <Ht)[(y(t),U(ty 1 )] (10.62) 

where U(t ) = q 1( cO) The covariant derivative of </> along y is 

Vx</> = ^[(/(0), U (0) -1 )] + O(0)[(y(0), U(0)~ l Ai(X) ■ 1)] 
at 

/do dx^\ u ( ao \ 

= (l063) 

Example 10.6. Let us consider the SU(2) Yang-Mills theory on M. The relevant 
bundles are the SU(2) bundle P(M, SU(2)) and its associated bundle E = 
P x p C 2 , where we have taken the two-dimensional representation. The gauge 
potential on a chart (/; is 

Ai = Aip dx^ = Aip a (^) dx" (10.64) 

where a a / 2i are generators of SU(2), o a being the Pauli matrices. Let e a ° 
(a = 1, 2) be basis vectors of C 2 and consider sections 

*«(/>) = [fa (P).*« 0 )] (10-65) 

where a, ( p ) defines a canonical trivialization of P over (/, . Let <p(p) — 
[(<7j(p), 0 a (p)e a 0 )] be a section of E over M. Along a horizontal lift y(t) = 
(Ti(p)U (t), U ( t ) e SU(2), we have 

<Kt) = [(y(0, U(tr l <t> a (t)e a 0 )]. (10.66) 


The covariant derivative of (j> along X = d/d t is 

X x <P = 


, d<t>“(0) n 
y(0), U( 0) _1 — —e, 0 


d t 


+ [(K(0), U(0)- l Ai(X) a ^( 0) e «°)] 
/ 3 $“ 




d x n + A A°'p® l} I e <* 


= x ^ 1 


(10.67) 



where (10.13b) has been used to obtain the last equality. 

Exercise 10.10. Let us consider an associated adjoint bundle E s = P x ,\d g where 
the action of G on g is the adjoint action V -> Ad„ V — g~ l Vg, Leg and 
g e G. Take a local section a, e T (C/,- , P) such that y(t) — Oi(t)g{l). Take a 
section s{p) — [(cr, •(/?), V(p))] on E g , where V (p) = V a (p)T a , {Ta} being the 
basis of g. Define the covariant derivative T>xs by 

V x s= (j?(0),^{Ad g(() -iV(f)} J ■ (10.68a) 

Show that 

V x s= ( CT ‘'(° ),^p- + [A(X),V(t)] ) 

( dV a \ 

— + fpy a A/vv\ [ (cr , (0) , T a )]. (10.68b) 

10.4.3 Curvature rederived 

The covariant derivative V,\'.v defines an operator V : T(M, E) -> T(M, E ® 
Q l ( M )) by (10.50). More generally, the action of V on a vector-valued //-form 
s ® i), i) e £2 P (M), is defined by 

V(s ® rf) = (Vi) A /; + s <S> d/ 7 . (10.69) 

Let Ui be a chart of M and cr,- a section of P over Ui . We take the canonical local 
trivialization over Ui . We now prove 

VVe a = e p ® (10.70) 

where e a = [(cr,-, e a 0 )] e T (Ui, E). In fact, by straightforward computation, we 
find 

VVe ff = V {ep ®Ai\) = Vep AA^ a + ep ® dAi\ 

= ep® (dA,^ + Ai P Y A Ai y a ) =ep® 

Exercise 10.11. Let s(p) = % a (p)e a (p) be a section of E. Show that 

VVi = e a <®S)%^. (10.71) 

10.4.4 A connection which preserves the inner product 

Let E — %• M be a vector bundle with a positive-definite symmetric inner product 
whose action is defined at each point p e M by 

g p : 7 r~ l (p) ® 7r ~ l (p) EL 


(10.72) 



Then g is said to define a Riemannian structure on £. A connection V is called 
a metric connection if it preserves the inner product, 

d [g(,y, *')] = g(Vs , s') + g(s, Vs'). (10.73) 

In particular, if we take s = e a , s' = ep and set g(e a , ep) — g a p, we find 

d gap — a gyP + pgcty ■ (10.74) 

This should be compared with (7.30b). If E = TM and, moreover, the torsion- 
free condition is imposed, our connection reduces to the Levi-Civita connection 
of the Riemannian geometry. 

Given an inner product, we may take an orthonormal frame [e a } such that 
g ( e u . ep) = S a p. The structure group G is taken to be O(k), k being the dimension 
of the fibre. The Lie algebra oik) is a vector space of skew symmetric matrices 
and the connection one-form a> satisfies 

(o a p = ~0J P a . (10.75) 

Theorem 10.6. Let £ be a vector bundle with inner product g and let V be the 
covariant derivative associated with the orthonormal frame. Then V is a metric 
connection. 

Proof. Since g is bilinear, it suffices to show that 

d[g(i, s')] = g(Vs, s') + g(s , V/) 

for s — fe a and s' = f'ep where /, f e T(M). In fact, the LHS is 
d [g(fe a , f'ep)} = d [ff'S a p] = d (ff')S a p while the RHS is 

giVfea, f'ep) + gif e a , V f'ep) 

= g (d/ e a + fe y co y a , f'ep) + gife a , d f ep + fe y co y p) 

= d //' Sap + ff'co Y a 8 y p + fdf s a p + ff'm y pK y 
= d iff) Sap 

where (10.75) has been used to obtain the final equality. □ 

10.4.5 Holomorphic vector bundles and Hermitian inner products 

Definition 10.6. Let E and M be complex manifolds and jt : E M a 
holomorphic surjection. The manifold £ is a holomorphic vector bundle if the 
following axioms are fulfilled. 

(i) The typical fibre is C k and the structure group is GL(k, C). 

(ii) The local trivialization <p ,• : Ui x C k -> 7r _1 ({/,) is a biholomorphism. 



(iii) The transition function tij : V, C\Uj —*■ G — GL(k, C) is a holomorphic 
map. 

For example, let M be a complex manifold with dim- M = m. The 
holomorphic tangent bundle TM + = [J psM T p M + is a holomorphic vector 
bundle. The typical fibre is C" and the local basis is {3/3 z^}. 

Let h be an inner product on a holomorphic vector bundle whose action at 
p e M is h p : Tt~ 1 (p) x n~ 1 (p) —> C. The most natural inner product is a 
Hermitian structure which satisfies: 

(i) li p (u,a v + bw ) = ah p (u, v) + bh p (u, w), for u, v, w e jr~ 1 (p), a, b e C; 

(ii) h p (u, v) — h p (v, u), u,ve it~ l (p); 

(iii) h p (u , u) > 0, h p (u, u) = 0 if and only if u = <f>i(p, 0); and 

(iv) h(s\,S 2 ) e 3 ! '(M) C forsi,S2 e F(M, E). 

A set of sections {ei, ...,%} is a unitary frame if 

h(ei,ej) = 8ij. (10.76) 

The unitary frame bundle LM is not a holomorphic vector bundle since the 
structure group U (m) is not a complex manifold. 

Given a Hermitian structure h, we define a connection which is compatible 
with h. The Hermitian connection V is a linear map T(M, E) -» T(M, E <g> 
T*M C ) which satisfies: 

(i) V(/s) = (d/)i + /Vs, / e T(M) C , s e T(M, £); 

(ii) d [h(s\, S 2 )] = h(Vs i, S 2 ) + A(si, VS 2 ); and 

(iii) according to the destination, we separate the action of V as Vs = Ds + D.v, 
D.v (Ds) being a (1, 0)-form ((0, l)-form) valued section. We demand that 
D = 9. 


It can be shown that given E and a Hermitian metric h, there exists a unique 
Hermitian connection V. The curvature is defined from the Hermitian connection. 
Let {e\ , . . . , ek} be a unitary frame and define the local connection form A^ a by 



(10.77) 

The field strength is defined by 


5F = dA *T A A A. 

(10.78) 

We verify that 


VVc„ = V{epA p 0l ) = ep? p a . 

(10.79) 


We prove that both A and T are skew Hermitian: 


A p a +A a p = h(Ve a , ep) + h(e a , Vep) = dh (e a , ep) = d S a p = 0 
3^ „ + 3 a p = d A^a + A^y A A Y a + d A a p + A a y A AY a 

= d (A^a - A p a ) + A\ a A Y a + A Y a A A a y = 0. 



Thus, we have shown that 

A a fS = -Af> a = -3V (10.80) 

Next we show that S' is a (1, 1 )-form. Let {<?„ } be a unitary frame. S cannot 
have a component of bidegree-(0, 2) since 

ep&a = VV3« = (D + 9)(D + d)e a = DDe a + (D9 + 3D)e a . 

It follows from = ~3~ a p that S' has no component of bidegree-(0, 2), and, 
hence, S has no component of bidegree-(2, 0) either. Thus is a two-form of 
bidegree-(l, 1). 

10.5 Gauge theories 

As we have remarked several times, a gauge potential can be regarded as a local 
expression for a connection in a principal bundle. The Yang-Mills field strength is 
then identified with the local form of the curvature associated with the connection. 
We summarize here the relevant aspects of gauge theories from the geometrical 
viewpoint. 

10.5.1 U(l) gauge theory 

Maxwell’s theory of electromagnetism is described by the U(l) gauge group. U(l) 
is Abelian and one dimensional, hence we omit all the group indices a, /?, . . . 
and put the structure constants f a p v — 0. Suppose the base space M is a four- 
dimensional Minkowski spacetime. From corollary 9.1, we find that the U(l) 
bundle P is trivial, namely P = IR 4 x U(l) and a single local trivialization over 
M is required. The gauge potential is simply 

A = Afi dx'L (10.81) 

Our gauge potential A differs from the usual vector potential A by the Lie algebra 
factor i: A jL = >A /t . The field strength is 

T=d A. (10.82a) 

In components, we have 

= dA v /dx » - dAfj/dx v . (10.82b) 

T satisfies the Bianchi identity, 

dT =3“ A A — A A T = 0. (10.83a) 

This should be expected from the outset since £T is exact, T = dA; and hence 
closed, d.A = d 2 A = 0. In components, we have 


3 X&HV + dv^X/j, + d^vx = 0. 


(10.83b) 



If we identify the components 'F jlv = i F jlv with the electric field £ and the 
magnetic field B as 


E, = Ft o, Bi = \e ijk Fj k (i, j, *=1,2, 3) (10.84) 

(10.83b) reduces to two of Maxwell’s equations, 

dB 

V x £ H =0 V £ = 0. (10.83c) 

3 1 

These equations are geometrical rather than dynamical. To find the dynamics, we 
have to specify the action. The Maxwell action 5m [.A] is a functional of A and 
is given by 

5m[-A] =\ f 3>T /ty d 4 x = -i [ F„ V F^ d 4 x. (10.85a) 

J K 4 3M 4 

Exercise 10.12. (a) Let *3^ = ^‘A KX e KklJLV be the dual of S^y. Show that 

5m [A] = -j f S' A *37 (10.85b) 

3M 4 

(b) Use (10.84) to show that 

-jF^F^ = ^(E 2 -B 2 ). (10.86) 

Show also that 

F/jlv * F^ — B ■ E. (10.87) 

By the variation of 5 m [A] with respect to we obtain the equation of 
motion. 


d^ v = 0. (10.88a) 

We find this equation is reduced to the second set of Maxwell’s equations (in the 
vacuum): 

3 E 

V • £ = 0 V x £ =0. (10.88b) 

3 1 


10.5.2 The Dirac magnetic monopole 

We have studied Maxwell’s theory of electromagnetism defined on IR 4 . The 
triviality of the base space makes the U(l) bundle trivial. Poincare’s lemma 
ensures that the field strength S' is globally exact: S' = cLA. It is interesting to 
extend our analysis to U(l) bundles over a non-trivial base space. We assume 
everything is independent of time for simplicity. 



The Dirac monopole is defined in K 3 with the origin O removed. K 3 — {0} 
and S 2 are of the same homotopy type and the relevant bundle is a U(l) bundle 
P(S 2 , U(l)). S 2 is covered by two charts 

Un = {(0, cp)\0 < 0 < + e} Us = {(0, (j))\\n — e < 9 < jt} 

where 0 and (p are polar coordinates. Let co be an Ehresmann connection on P. 
Take a local section on (cts) on J/n (Us) and define the local gauge potentials 

.An = As = or,?a>. 

We take .An and .As to be of the Wu-Yang form (section 1.9), 

A N = ig(l - cos6)d(p As = — ig(l + cos0) dcp (10.89) 

where g is the strength of the monopole. 

Let ?ns be the transition function defined on the equator Un (T Us- ?ns 
defines a map from S l (equator) to U(l) (structure group), which is classified 
by 7Ti (U ( 1)) = Z, see example 9.7. Let us write 

*NsW>) = exp[ip(0)] {(fi : S 1 ffi). (10.90) 

The gauge potentials An and As are related on Ujs fl Us by 

An = ? N s A-sAss + t NS ' dtNS = As + id^o. (10.91) 

For the gauge potentials (10.89), we find 

d <p = -i(AN - As) = 2 g dip. 

While (p runs from 0 to 2tt around the equator, <fi((p) takes the range 

A <p = J dcp = J 2gd(p = Ang. (10.92) 

For fNS to be defined uniquely, A<p must be a multiple of 2 it , 

A(p/2n = 2g e Z (10.93) 


which is the quantization condition of the magnetic monopole. The integer 2 g 
represents the homotopy class to which this bundle belongs. This number is also 
obtained by considering Fn = dAN and Fs = dAs ( .T n = i /'n etc). The total 
flux is 


cf> — 



(10.94) 


Thus, the curvature, that is the pair of the field strengths dAN and dAs, 
characterizes the twisting of the bundle. We discuss this further in chapter 1 1 . 


s 
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Figure 10.4. The Aharonov-Bohm experiment. B = 0 outside the solenoid. 


10.5.3 The Aharonov-Bohm effect 

In the elementary study of electromagnetism, the electric and magnetic fields (that 
is F/j V ) are of central interest. The vector potential A and the scalar potential 
4> — Ao are considered to be of secondary importance. In quantum mechanics, 
however, there are a variety of situations in which F jlv are not sufficient to 
describe the phenomena and the use of A t , = (A, Ao) is essential. One of these 
examples is the Aharonov-Bohm effect. 

The Aharonov-Bohm (AB) experiment is schematically described in figure 
10.4. A beam of electrons with charge e is incoming from the far left and forms 
an interference pattern on the screen C. A solenoid of infinite length is placed in 
the middle of the beam. A shield S prevents electrons from penetrating into the 
solenoid. Accordingly, the electrons do not feel the magnetic field at all. What 
about the gauge field A^ ? 

For simplicity, we make the radius of the solenoid infinitesimally small, 
keeping the total flux <t> = f s B ■ dS fixed. It is easy to verify that 

/ yep ,y<1> \ 

Afr)^-^,— ,°) A 0 = 0 (10.95) 

satisfies /(V x A) • dS = <t> and V x A = 0 for r 0. The vector potential 
does not vanish outside the solenoid. Classically, the solenoid cannot have any 
influence on electrons since the Lorentz force e(v x B) vanishes on the path of 
the beam. 



In quantum mechanics, the Hamiltonian H of this system is 


n = - 


1 

2m 



+ V{r) 


(10.96) 


where V (r) represents the effect of the experimental apparatus. Semiclassically, 
we can distinguish between the paths y\ and y\i in figure 10.4. We write the 
wavefunction corresponding to y\ (y\\) as i/q (i/qi) when A — 0. If A ^ 0, the 
wavefunction is given by the gauge-transformed form, 

= exp ^ie J A(r') ■ dr'^ i/q(r) (i =1, II) (10.97) 


where P is a reference point far from the apparatus. Let us consider a 
superposition i// [ /t + i/qf of wavefunctions i j/f and i/qf such that \[r^ ( P) = 
i/^jf ( P). Its amplitude at a point Q on the screen is 


fi(Q) + lAn (G) = exp (ie J A(r') ■ dr^i/a(6) 
+ exp ^ ie J A(r’) ■ dr ; 


1m (Q) 


= exp ^ie J A ■ dr' j exp yie (j) 


exp ( i e(p A- dr' )\fa(Q) + fn (Q) 

(10.98) 


where y = yi — ya- It is evident that even though B = 0 at the points in 
space through which the electrons travel, the wavefunction depends on the vector 
potential A. From Stokes’ theorem, we find that 

(f A- dr' = I (V x A) ■ dS = / B ■ dS = <t> (10.99) 

Jy Js JS 

where S' is a surface bounded by y . From this and ( 10.98), we find the interference 
pattern should be the same for two values of the fluxes <t> a and <J>b if 

e(<t> a — <I>b) = 27 rn n e Z. (10.100) 


What is the geometry underlying the Aharonov-Bohm effect? Since the 
problem is essentially two dimensional, we consider a region M — K 2 — { 0 }, 
where the solenoid is assumed to be at the origin. The relevant bundles are the 
principal bundle P(M, U(l)) and its associated bundle E = P x p C, where U(l) 
acts on C in an obvious way. The bundle £ is a complex line bundle over M, 
whose section is a wavefunction \j/ . 

Let us define a Lie-algebra-valued one-form A = iA = iAj, d.L'. The 
covariant derivative associated with this local connection is V = d + A, where 


A is given by (10.95). Since dA = S' = 0, this connection is locally flat. Let 
us consider the unit circle S ’ 1 which encloses the solenoid at the origin. We 
parametrize .S ’ 1 as c'" (0 < 0 < 2k) and write the connection on S 1 as 

A — i— d0. (10.101) 

2k 

This is obtained from (10.95) by putting r = 1. We require that the wavefunction 
i/r be parallel transported along .S ’ 1 with respect to this local connection, namely 

Vijf{0) = (d + i-^-d6^jil/(6) = 0. (10.102) 

The solution of (10.102) is easily found to be 

f(6) = e“ i<t>0/2jr . (10.103) 

Taking this section \jj amounts to neglecting the velocity of the electrons. The 
holonomy T : k~ 1 (6 — 0) -> 7 r -1 (0 = 2k) — k~ 1 (9 = 0) is found to be 

T : VK 0) i — s e“ l 4 > iA(0). (10.104) 

In an experiment, a toroidal permalloy (20% Fe and 80% Ni) has been used 
to eliminate the edge effects (Tonomura et al 1983). The dimensions of the 
permalloy are several microns and it is coated with gold to prevent electrons from 
penetrating into the magnetic field. 

10.5.4 Yang-Mills theory 

Let us consider SU(2) gauge theory defined on M 4 . The bundle which describes 
this gauge theory is P(K 4 , SU(2)). Since IR 4 is contractible, there is just a single 
gauge potential 

A = A^ a T a dx^ (10.105) 

where T a = o a /2i generate the algebra su(2), 

[Tc 7jg] = €c,p Y Ty. 

The field strength is 

J=dA + AAA = 5 dx 11 A d.r y (10.106a) 

where 

3> = d„A v - d v A tl + [A„,A V ] = F„ v a T a (10.106b) 

F iiv = dfiAvoc “I" €a/3y A-npAvy . (10.106c) 

The Bianchi identity is 


PT=d T+[yi, T] = 0. 


(10.107) 



The Yang-Mills action is 


< 5 ym[-A] = ~\ f WV tv ) = ?f tr(TA*T). 
J M ~ Jm 

The variation with respect to A M yields 


V^ 11 = 0 or V * T = 0. 


(10.108) 


(10.109) 


10.5.5 Instantons 

A path integral is well defined only on a space with a Euclidean metric. To 
evaluate this integral, it is important to find the local minima of the Euclidean 
action and compute the quantum fluctuations around them. Let us consider the 
SU(2) gauge theory on a four-dimensional Euclidean space ffi 4 . The local minima 
of this theory are known as instantons (or pseudoparticles, Belavin et al (1975)), 
see section 1.10. It is easy to verify that the Euclidean action is 

S%m[A] = l [ tr(T /xu T ,il ') = -I [ tr(T A *T) (10.110) 

Jm ~ Jm 

where the Hodge * is taken with respect to the Euclidean metric. As has been 
shown in section 1.10 the field strength corresponding to instantons is self-dual 
(anti-self-dual), 

V = ±*V- (10.111) 

The action of a self-dual (anti-self-dual) field configuration is 

= tr(T a *T) = =f ?/ tr(TAT). (10.112) 

“ Jm “ Jm 

Let us consider the topological properties of an instanton. We require that 

At(x) ->• g(x)~ 1 d ll g(x) as |*| — »■ L (10.113) 

for the action to be finite, where L is an arbitrary positive number. Since \x\ = L 
is the sphere S 3 , (10.113) defines a map g : S 3 SU(2) which is classified 
by tt 3 (SU(2)) = Z. How is this reflected upon the transition function? We 
compactify TO 4 by adding the infinity. We suppose the South Pole of ,5 4 represents 
the points at infinity and the North Pole the origin. Under this compactification, 
we separate M 4 into two pieces and identify them with the southern hemisphere 
Us and the northern hemisphere U\) of S 4 as 

U N ={x G M 4 ||jc| < L + e} (10.114a) 

U s ={x eR 4 \\x\> L - e] (10.114b) 

see figure 10.5. We assume there is no ‘twist’ of the gauge potential on Us and 
choose 


As (a) = 0 x e Us- 


(10.115) 





Figure 10.5. One-point compactification of R 4 to S' 4 . 


Then all the topological information about the bundle is contained in .An(x) or 
the transition function JnsM on the ‘equator’ S 3 (=(/n (T Us). Since .As = 0, we 
have, for x e {/n H Us, 


■An = f N sAsf N s + f N s d?NS = t N s d?NS- ( 10 . 116 ) 

Thus, g(x) in (10.113) is identified with the transition function ty,s(x) and 
classifying the maps g : S 3 — > SU(2) amounts to classifying the transition 
functions according to 7T3(SU(2)) = Z; see example 9.11. 

We now compute the degree of a map g : S 3 -» SU(2) following Coleman 
(1979). First note that SU(2) ~ S 3 since 

t A I 2 + foi e SU(2) ±>t 2 + (r 4 ) 2 = 1. 

Thus, maps g : S 3 — > SU(2) are classified according to 7T3(SU(2)) = TT^iS 3 ) = 
Z. We easily find the following. 

(a) The constant map 


S0 :xeS 3 ^eeSU(2) (10.117a) 

belongs to the class 0 (i.e. no winding) of 7T3(SU(2)). 

(b) The identity map (this is, in fact, the identity map S 3 — > .S' 3 ) 

gi : x i-> ~[x A h + x l (Ti], r 2 = x 2 + (x 4 ) 2 (10.1 17b) 

r 

defines the class 1 of 7T3(SU(2)). The explicit form of the gauge potential 
corresponding to this homotopy class is given in section 1.10. 

(c) The map 

g„ = {gi) n : XC r~' l [x 4 I 2 + x‘(Ti] n (10.117c) 

defines the class n of jt3(SU(2)). 


We recall that the strength (charge) of a magnetic monopole is given by the 
integral of the field strength T = d.A over the sphere S 2 . We expect that a similar 




relation exists for the instanton number. Since instantons are defined over .S' 4 , we 
have to find a four-form to be integrated over S 4 . A natural four-form is T A 3\ In 
the following, we shall omit the exterior product symbol when this does not cause 
confusion (T 2 stands for IF A 3). Observe that trSF 2 is closed, 

d tr T 2 = tr[dTT+TdT] 

= tr{— [A, S']? - $[A, T]} = 0 (10.118) 

where use has been made of the Bianchi identity dfF + [A, 9“] = 0. [ Remarks : In 
the present case, (10.1 18) seems to be trivial since any four-form on .S’ 4 is closed. 
Note, however, that (10. 1 1 8) remains true even on higher-dimensional manifolds.] 
By Poincare’s lemma, the closed form trT 2 is locally exact, 

trT 2 = d K (10.119) 

where A" is a local three-form. Thus, tr T 2 is an element of the de Rham 
cohomology group // 4 (.S -4 ). Later tr T 2 is identified with the second Chern 
character and K its Chern-Simons form, see chapter 1 1 . 

Lemma 10.3. The three-form K in (10.1 19) is given by 

K = tr[Adyi+ fyi 3 ]. (10.120) 

Proof. A straightforward computation yields 

d K = tr[(AA) 2 + ]{AAA 2 -AAAA+A 2 AA)] 

= lr[(3 - A 2 )(3 - A 2 ) 

+ f {(T - A 2 )A 2 - A{3 - A 2 )A + A 2 (3 7 - A 2 )}] 

= tr[T 2 - A 2 ? - 3\A 2 + A 4 + § (9 A 2 - A3 7 A + A 2 ? - A 4 )] 
where use has been made of the identity d A = T — A 2 . Now we note that 
trA 4 = 0 ixAS'A = — trA 2 3 7 = — trS'A 2 . 

For example, we have 

tr/LL/l = ^ tiA^xnAv dx K A d.r* A d.r ^ A d.r y 

= — 2 tiA v A K 3'xfi Ax v A Ax K A dr* A Ax^ = — tr/l 2 3 r 

where the cyclicity of the trace and the anti-commutativity of d.r 11 have been used. 
Then AK becomes 

AK = tr[T 2 - A 2 3' - 3\A 2 + j{JA 2 + ±(37l 2 + A 2 ?) + A 2 T}] 

= trT 2 


as has been claimed. 


□ 


Lemma 10.4. Let A be the gauge potential of an instanton. Then it follows that 




trA 3 . 


( 10 . 121 ) 


Proof. From Stokes’ theorem, we find that 




K 


where Us is defined by ( 10 . 1 14 ) and S 3 = dUs- Since S' = 0 on S 3 , we obtain 
k = tv[AAA + |yi 3 ] = tr[yi(S - a 2 ) + fyi 3 ] = -± tryi 3 
on S 3 , from which we find that 


[ trT 2 = i 

f trT 2 = — i / 

r , 

trA 3 

Ju N J 

s 4 J 

s 3 


where we have added tr (F 2 = 0 since .As =0. □ 


Note that tr S 2 is invariant under the gauge transformation, 
trS 2 ->tr[g“ 1 S 2 g] = trS 2 . 


Thus, it is reasonable to assume that trS 2 indeed contains a certain amount 
of topological information about the bundle, which is independent of particular 
connections. Let us consider the gauge fields (10.1 17a— c) given before. We find: 


(a) For go(x) = e, we have A — 0 on S 3 . Since the bundle is trivial we may 
take A = 0 throughout .S' 4 . Then S = 0, hence 



( 10 . 122 ) 


Note that this relation is true for any gauge potential which is obtained from 
A — 0 by smooth gauge transformations, that is for any gauge potential of 
the form A(x) — g(x) _1 dg (x), x e S’ 4 . 

(b) Next consider a gauge potential whose value on S 3 is given by (10. 1 17b) as 



— Lx k (ik) d 



+ i x l oi) 


(10.123) 


A considerable simplification is achieved if we note that the integrand tr./l 3 
should not depend on the point on S 3 at which it is evaluated since gi maps 
S 3 onto SU(2) A .S 3 in a uniform way. So we may evaluate it at the North 
Pole (x 4 = l.x = 0 ) of the unit sphere. We then find A — i dv 4 and 


tr.4 3 = i 3 tr[crj(j jcrk\ dx ! A d-W A dx k 

= 2sijk dx' A dx- 7 A dx k — 12 dx 1 A dx 2 A dx 3 . (10.124) 



Next we note that (x x ,x , x 3 ) is a good coordinate system on each 
hemisphere of S 3 and a> = dx 1 A dx 2 A dx 3 is a volume element at the 
North Pole. We find 

/ trA 3 =12 / co= 12(2tt 2 ) = 24tt 2 

Js 3 is 3 

where 2jt 2 is the area of the unit sphere .S' 3 . We finally obtain 

(10125) 

(c) Next we consider the map g n : .S 3 — > SU(2) given by (10.117c). We 
show that g 2 = gig i has a winding number 2. We divide .S 3 into the 
northern hemisphere and the southern hemisphere . Given a map 

g\ : S 3 -> SU(2), it is always possible to transform g\ smoothly to giN 

( 3 ) 

which has the winding number one and £in(*) = e for x e U s . All the 

( 3 ) 

variation takes place on [W . Similarly, g i may be deformed to gis with the 
same winding number and gis(x) = e for x e 6+ . Under this deformation, 
g 2 becomes 


82 (x) -> g' 2 (x) 


I SinO) x eU x 
gtsC*) 


X eU, 


( 3 ) 

N 

( 3 ) 


For A(x) — g' 2 (x) 1 dg 2 (x) (x e S’ 3 ), we have 

dc L = dc (/„,» dsmf + 4, ,r< ^ 1 dsis|3 


= 1 + 1 = 2 . 

Repeating the same procedure we find for A(x) — g ~ 1 dg„ that 


__L f tr + 2 = _L f 

8?r 2 J 24^2 


trA 3 = n. 


Is 4 24jt 2 4 3 

Collecting these results we establish the following theorem. 

Theorem 10.7. The degree of mapping g : S 3 -a SU(2) is given by 

,2 

/ 1J 
tr 


n = dcL' a ^' ds>> = 


if 

2 Js 4 


cry 

27T / 


(10.126) 


(10.127) 


(10.128) 


10.6 Berry’s phase 

In quantum mechanics, we define a wavefunction up to the phase. In most 
cases, the phase is neglected as an irrelevant factor. Berry (1984) pointed out 
that if the system undergoes an adiabatic change, the phase may have observable 
consequences. 



10.6.1 Derivation of Berry ’s phase 


Let H(R) be a Hamiltonian which depends on some parameters collectively 
written as R. Suppose R changes adiabatically as a function of time, R = R(t). 
The Schrodinger equation is 

(10.129) 

at 

We assume the system at t — 0 is in the nth eigenstate, |i//-(0)} = | n, R( 0)) where 
H(R( ;0))|n, *(0)> = E„(Rm\n, R( 0)>. (10.130) 


What about the state |i jr{t)) at later time r > 0? We assume the system is always 
in the nth state, i.e. no level crossing takes place (adiabatic assumption). 


Exercise 10.13. A naive guess of |i jr{t)) is 


\f(t)) = exp 




i / d s E n (R(s)) 


I n.R(t)) 


(10.131) 


where the normalized state \n, R(t)) satisfies 


H(R(t))\n, R(t)) = E„(R(t))\n , R(t)}. (10.132) 


Show that (10.131) is not a solution of (10.129). 


Since (10.131) does not satisfy the Schrodinger equation, we have to try 
other possibilities. Let us introduce an extra-phase rj n (t) in the wavefunction: 


I fit)) = exp 


'L 


i? 7 (r) — i / E n {R(s))ds 


I n.R(t)). 


(10.133) 


Inserting (10.133) into the Schrodinger equation (10.129), we find 
H(R(t)M(t)) = E n (R(t)M(t)) 


for the LHS (see (10.132)) and 


i-H = 

at 


d T]n ( 1 ) 


d t 


E n (R(t )) 


exp 


i r} n (t) ~ i 


'f En 


mt)) 

(R(s))ds 


i-\n,R(t)) 

at 


for the RHS. Equating these, it is found that rj n (t ) satisifes 



By integrating (10.134), we obtain 


r d 

rin(t) = i (n, R(s)\ — \n, R(s))ds 

Jo 

r R(t) 

= i / (n, R\V r\h, R)dR 

Jri 0 ) 


(10.135) 


where V r stands for the gradient in R-space. Note that r] n (t) is real since 


2Re(n, R(s ) | — | n, R(s)} 
d.? 

= (n, R(s)\^-\n, R(s)) + ( -f <n, R(s ) | I | n, R(s)) 


d.v V d.s 


= — (n, R(s)\n, R(s)) = 0. 
d.v 


Suppose the system executes a closed loop in //-space; /NO) = R(T ) for some 
T > 0. We then have 


rin(T) 


f T d 

= i {n,R(s)\ — \n,R(s))ds 

Jo d^ 

pR(T) 

= i / («, R\ Vr\h, R)dR. 

Jr( 0) 


(10.136) 


Since R(T) — /NO), the last expression seems to vanish. However, the integrand 
is not necessarily a total derivative and r/ n (T ) may fail to vanish. The phase >] n (T) 
is called Berry’s phase (Berry 1984). 

It was Simon (1983) who first recognized the deep geometrical meaning 
underlying Berry’s phase. He observed that the origin of Berry’s phase is 
attributed to the holonomy in the parameter space. We shall work out this point 
of view following Berry (1984), Simon (1983), Aitchison (1987) and Zumino 
(1987). 


10.6.2 Berry’s phase, Berry’s connection and Berry’s curvature 

Let M be a manifold describing the parameter space and let R = (/?!,...,/?*) 
be the local coordinate. At each point R of M , we consider the normalized nth 
eigenstate of the Hamiltonian H(R). Since a quantum state | n; R) cannot be 
distinguished from e 1 ^ |n; R), a physical state is expressed by an equivalence class 

[|*>]=={*|*)lseU(l)} (10.137) 

where we omit the index n since we are interested only in the nth eigenvector 
(figure 10.6). At each point R of M, we have a U(l) degree of freedom and we 
have a U(l) bundle P(M, U(l)) over the parameter space M. The projection is 
given by n(g\R)) = R. 


g\R) 

i»> 


n 



Figure 10.6. The fibre of a quantum mechanical system which depends on adiabatic 
parameters R. 


Fixing the phase of \ R) at each point R e M amounts to choosing a section. 
Let a(R) = \R) be a local section over a chart U of M. The canonical local 
trivialization is given by 

R)) = (R,e). (10.138) 

The ‘right’ action yields 

4>~ 1 (.\R)-g) = (R,e)g=(R,g). (10.139) 

Now that the bundle structure is defined, we provide it with a connection. 
Let us define Berry’s connection by 

A = A lt d R^ 1 = (R\(d | R)) = ~(d{R\)\R) (10.140) 

where d = (9/9 R 1 ' is the exterior derivative in R- space. Note that A is 
anti-Hermitian since 

0 = d((*|*» = (d(*|)|fi) + (R\d\R) = (R\d\R)* + (R\d\R). 

To see (10.140) is indeed a local form of a connection, we have to check the 
compatibility condition. Let Uj and U j be overlapping charts of M and let 
<7i(R) = |/?)j and <jj(R) = \R)j be the respective local sections. They are 
related by the transition function as \R) / = | R)jtij(R). We then find that 

Aj(R) - j(R\d\R)j = tij (R)~ x i{R\[d\R) itjj (R) + \R)d tij (R)] 

= Ai(R) + tijiRy'dtijiR). (10.141) 

The set of one-forms {.A;} satisfying (10.141) defines an Ehresmann connection 
on P{M, U(l)). 



The field strength S' of A is called Berry’s curvature and is given by 

? = dA = (d(*|) a (d| R)) = (jfr) dR " A dR "- d°- 142 ) 

After an example from atomic physics, we shall clarify how this geometrical 
structure is reflected in Berry’s phase. 

Example 10.7. Let us consider a quantum mechanical system which contains 
‘fast’ degrees of freedom r and ‘slow’ degrees of freedom R. For example, we 
may imagine an electron moving under the potential of slowly vibrating ions. 
Suppose the Hamiltonian is given by 

H =^- + TU + y (r\ R ) (10.143) 

2m 2 M 

where p(P) is the momentum canonical conjugate to r(R). As a first 
approximation, we may consider the slow degrees of freedom are ‘frozen’ at some 
value R and consider an instantaneous sub-Hamiltonian 

p 2 

h(R) = — + V(r\R) (10.144) 

2m 

and the eigenvalue problem 

h(R)\R) = e„(R)\R) (10.145) 

where | R) stands for the nth eigenvector \n: R) of the ‘fast’ degrees of freedom. 
We assume that the eigenvalue is isolated and non-degenerate. Berry’s connection 
is A(/?) = (/?|d|/f), while the curvature is 9“ = (d(/?|) A (d| /?)). 

It is interesting to see how the fast degrees of freedom affect the slow degrees 
of freedom. We assume the total wavefunction is written in the form 


'I'(r; R) = $>(R)\R) (10.146) 

and find the ‘effective’ Schrodinger equation which <f> ( /? ) , the wavefunction 
of the ‘slow’ degrees of freedom, satisfies. The eigenvalue problem of the 
Hamiltonian (10.143) is 

HV(r; R) = - ^[V| <WI*> + 2 V«c p(R) . V K | R) + ®(R)V 2 r \R)] 

_ d>(/?) J-V r 2 | R) + 4>(R)V(r; R)\R) 

2m 

= E n (R)<$(R)\R). 

If we multiply (R\ on the left and use the Schrodinger equation (10.145), this 
equation becomes 

+ 2VrcD(/?) . (R\V r \R) + 4>(/?)((/?|Vk |/?)) 2 ] 

+ e n (R)<t>(R) = E n {R)<S>(R) 


(10.147) 



where we have employed the Born-Oppenheimer approximation, in which all the 
matrix elements except the diagonal ones are neglected, 

<n; fl|V R |n'; R) = 0 n'/«- (10.148) 

Now the effective Hamiltonian for 1 4>(/?)) is given by 

! / 9 \ 2 

H eS (n) = -—\— I +A ll (R)\ + e„(R) (10.149) 

where A t , is a component of Berry’s connection, 

^L #t («) = (*|^ r |*). (10.150) 

It is remarkable that the fast degrees of freedom have induced a vector potential 
coupled to the slow degrees of freedom. Note also that the eigenvalue e„(R) 
behaves as a potential energy in This ‘spontaneous creation’ of the gauge 
symmetry reflects the phase degree of freedom of the wavefunction R). 

The Schrodinger equation describing the adiabatic change is 

H(R(t))\R(t),t) =i— I R(t),t) (10.151a) 

dr 

where we note that \R(t), t) has an explicit f-dependence as well as an implicit 
one through R(t). Berry assumes that 

\R(t),t) =exp(^-ij E n (t) dr^e ii?<f, |/?(r)> (10.152a) 

where | R) is an instantaneous normalized eigenstate of H(R), 

H(R)\R) = E„(R)\R) (R\R) = 1. (10.153) 

The first exponential is the ordinary dynamical phase while the second one is 
Berry’s phase. It is convenient for our purpose to define an operator 

n(R) = H(R) -E n (R) (10.154) 

to dispose of the dynamical phase. The state \R) is the zero-energy eigenstate of 
'H(R): 'H(R)\R) — 0. The solution of the modified Schrodinger equation, 

WR)\R(t), t) = i— \R(t), t) (10.151b) 

dr 


is then given by 


l*(r),r> = e i ^ ) |/?(r)>. 


(10.152b) 



We found in (10.136) that rj is given by 


f d Rt 1 8 C R(t) 

= i ds—(R(s)\ — \R(s))=i (R\d\R). (10.155) 

Jo d.s dR> L J R{ o) 

We show that Berry’s phase is a holonomy associated with the connection 
(10.140) on P(M, U(l)). Take a section <r(R) = |/?> over a chart U of M. Let 
R : [0, 1] —*■ M be a loop in U . 2 We write a horizontal lift of R{t) with respect 
to the connection (10.140) as 

R{t)=a(R(t))g(R{f)) (10.156) 


where g(R( 0)) is taken to be the unit element of U(l). The group element g(t) 
satisfies (10.13b), 

dg(t) , / d \ d 

= -A l -j = ~{R(t)\ — \R(t)) (10.157) 


where g(t) stands for g(R(t)). From g(t) = exp(i?;(r)), we obtain 


d n(f) d 

= -(R(t)\-m)) 
dr dr 


which is easily integrated to yield 


r?(l) 




(R(s)\ — \R(s))ds=i<P(R\d\R). 


(10.158) 


Let us note that R( 0) = /?(1), hence |/?(0)) = |/?(1)}. Then exp[i?;(l)] is 
regarded as a holonomy (figure 10.7) 


/?(!) = exp 


[R\d\R] 


l*(0)>. 


(10.159a) 


Exercise 10.14. Let S be a surface in M, which is bounded by the loop R(t). 
Show that 

R(l) = exp(-£$\-\R(0)) (10.159b) 

where T is given by (10.142). 

Example 10.8. Let us consider a spin- J particle in a magnetic field with the 
Hamiltonian 


Kl -«f 2 )' <10160) 


2 


We shall be a little sloppy in our notation. 



Figure 10.7. If the parameter changes adiabatically along a loop R (f), the state with initial 
condition \R (0)) becomes |/J(1)) which is different from \R (0)) in general. The difference 
is the holonomy and is identified with Berry's phase. 


The parameter R corresponds to the applied magnetic field. This is a two-level 
system taking eigenvalues ±|/?|. Let us consider the eigenvalue R = +|/?|. 
According to the prescription just described, we introduce a Hamiltonian H(R) = 
H(R)-\R\ and consider the zero-energy eigenstate of 'H(R) given by 

I*)n = [2 R(R + R3)]~ l/2 ( ^ + ^ \ . (10.161) 

The gauge potential is obtained after a straightforward but tedious calculation as 

R~> dWi — R\ d R~> 

yL N = N (j?|d|/?) N = -i - , ' ■ (10.162) 

ZK(K + K 3 ) 


The field strength is 

1 R\ d/?2 A d/?3 + Rn d/?3 A df?j + /?3 df?i A d/?T 

T = dA= 2- 2 1 £. (10.163) 

2 R 3 

So far we have assumed that the state | R) is isolated. However, this 
assumption breaks down if R = 0, in which case two eigenstates are degenerate. 
Surprisingly, this singularity behaves like a magnetic monopole in /f-space. To 
see this, we introduce polar coordinates 0 and c/j in R -space, 


Ri — R sin0 cos cp R 2 — R sin 6 sin (/> Rt, = Rcos9. 



The state (10.161) is expressed as 


cos(0/2) 
sin (0/2) 


l*>N 



(10.164) 


This state is singular at 6 = it, reflecting that \R)s is not defined for R 3 = —R. 
Consider another eigenvector 


|*)s = e" i *|«) N = 


e 


-^cos(0/2) \ 
sin (0/2) ) 


= [2 R(R- R 3 )]~ l/2 


Ri — i^2 \ 
r-r 3 ) 


(10.165) 


with the same eigenvalue. This eigenvector is singular at 0 = 0, that is at 
R 3 = R. Corresponding to these vectors, we have Berry’s gauge potentials in 
polar coordinates, 


An =ji(l — cos0) d(p 0 ^ 7t 

(10.166a) 

As — — ji(l + cos0) d(/> 0^0. 

(10.166b) 

They are related by the gauge transformation, 

A s = An — id </> = A N + e i0 de“^ 

(10.167) 


where g(jt/2, <p) — exp(— i (j>) is identified with the transition function /ns- 
Equation (10.166) is simply the vector potential of the Wu-Yang monopole of 
strength —j, see sections 1.9 and 10.5. The total flux of the monopole is 
= 4 t r(— I) = —2jt. 

The analogy between the present problem and the magnetic monopole is 
evident by now. If we fix the amplitude R of the magnetic field, the restricted 
parameter space is S 2 . At each point R of .S' 2 , the state has a phase degree 
of freedom. Thus, we are dealing with a U(l) bundle P(S 2 , U(l)), which also 
describes a magnetic monopole. For each choice of the parameters R , we have 
a fibre corresponding to the «th eigenstate n : R). The fibre at R consists of the 
equivalence class [|/?)] defined by (10.137). The projection jt maps a state to 
the parameter on which it is defined: tx : e'“ | /?) -> R e S 2 . As we have seen, 
this bundle is non-trivial since it cannot be described by a single connection. The 
non-triviality of the bundle implies the existence of a monopole at the origin. Note 
that R — 0 (that is, B = 0) is a singular point at which all the eigenvalues are 
degenerate. 

Next we turn to the problem of holonomy. Take a standard point R( 0) 
on S 2 and choose a vector | /? (0)) . We choose a loop R(t) on S 2 and execute 
a parallel transportation of | /? (0) ) along R(t), after which it comes back as 
a vector exp[i?j(l)]|/?(0)). The additional phase i] represents the holonomy 



— »■ 7t 1 (R) and corresponds to Berry’s phase. From (10.158), rj( 1) 
is given by 

r)(l) = i(b A = i I J (10.168) 

Jr Js 

where £F = AA is the field strength and S is the surface bounded by the loop R(t). 
It follows from (10.168) that Berry’s phase 77 (1) represents the ‘magnetic flux’ 
through the area S. 


Exercise 10.15. Use (10.165) to show that 


1 R 2 d/f 1 — R 1 dR 2 

2 R(R - R 3 ) 


Show also that 

R 2 df?i — Ri dR 2 

d (b — . 

(R + R 3 )(R - R 3 ) 

Observe that dtp is singular at R 2 — ±R. 


(10.169) 

(10.170) 


Problems 

10.1 Consider a two-dimensional plane M with coordinate R and a wavefunction 
1/7 which depends on R adiabatically as 1 // = \j/(r . R). Let R : [0, 1] — > M 
be a loop in M and suppose rfr (r, R(l)) = —i/sir, R( 0)), that is the phase of 1 jr 
changes by it after an adiabatic change along the loop. Show that there is a point 
within the loop at which the adiabatic assumption breaks down. See Longuet- 
Higgins (1975). 
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CHARACTERISTIC CLASSES 


Given a fibre F, a structure group G and a base space M, we may construct 
many fibre bundles over M, depending on the choice of the transition functions. 
Natural questions we may ask ourselves are how many bundles there are over M 
with given F and G, and how much they differ from a trivial bundle M x F . For 
example, we observed in section 10.5 that an SU(2) bundle over S 4 is classified 
by the homotopy group 7T3(SU(2)) = Z. The number n e Z tells us how the 
transition functions twist the local pieces of the bundle when glued together. 
We have also observed that this homotopy group is evaluated by integrating 
tr T 1 e // 4 (.S -4 ) over .S -4 , see theorem 10.7. 

Characteristic classes are subsets of the cohomology classes of the base 
space and measure the non-triviality or twisting of a bundle. In this sense, they 
are obstructions which prevent a bundle from being a trivial bundle. Most of the 
characteristic classes are given by the de Rham cohomology classes. Besides their 
importance in classifications of fibre bundles, characteristic classes play central 
roles in index theorems. 

Here we follow Alvalez-Gaume and Ginsparg (1984), Eguchi et al (1980), 
Gilkey (1995) and Wells (1980). See Bott and Tu (1982), Milnor and Stasheff 
(1974) for more mathematical expositions. 

11.1 Invariant polynomials and the Chern-Weil homomorphism 

We give here a brief summary of the de Rham cohomology group (see chapter 6 
for details). Let M be an ra -dimensional manifold. An /‘-form co e Q r (M) is 
closed if dai = 0 and exact if co — dz; for some // e Q 1 1 (M). The set of closed r- 
forms is denoted by Z' (M) and the set of exact /--forms by B‘ (M). Since d 2 = 0, 
it follows that Z' (M) D B r (M). We define the rth de Rham cohomology group 
H r {M) by 

H r (M) = Z r (M)/ B r (M). 

In H r (M), two closed /‘-forms a>\ and u > 2 are identified if co 1 — on = dz; for some 
r] e ~ 1 (A7). Let M be an m -dimensional manifold. The formal sum 

H*(M) = H°(M) © H\M) ® • • ■ © H m (M) 

is the cohomology ring with the product A : H*{M) x H*(M) — > H*(M) 
induced by A : H P (M ) x H q (M) -a H p+q (M). Let f : M —*■ N be a 


smooth map. The pullback /* : Q r (N) — > Q' (M) naturally induces a linear 
map f* : H r (N) — > H r (M) since /* commutes with the exterior derivative: 
/* d co = d /*&>. The pullback f* preserves the algebraic structure of the 
cohomology ring since /*(&> A rj) — f*a> A /*?/. 

11.1.1 Invariant polynomials 

Let M{k, C) be the set of complex k x k matrices. Let S r (M(k, Q) denote 
the vector space of symmetric / -linear C-valued functions on M (k. C). In other 
words, a map 

P :® M(k, C) -» C 

is an element of S' (M(k, Q) if it satisfies, in addition to linearity in each entry, 
the symmetry 

P(fli, . . . , a,, . . . , dj , . . . , a r ) 

— P{a\, . . . , aj, . . . , a,-, . . . , a r ) 1 <i,j<r (11.1) 
where a p e GL(k, Q. Let 


OO 

S*(M(k, Q) = © S r (M(k, O) 

r = 0 

denote the formal sum of symmetric multilinear C-valued functions. We define a 
product of P e SP(M(k , Q) and Q e S«(M(k, Q) by 

PQ(X 1 ,...,X p+q ) 

= (p + q )\ X! P( X PW’ • • ■ > x P(p))Q( x P(p+ 1). • • • . ^P(//+ 9 )) (H-2) 

where P is the permutation of (1, . . . , p + q). S*(M(k , Q) is an algebra with 
this multiplication. 

Let G be a matrix group and g its Lie algebra. In practice, we take 
G — GL(k, C),U(A:) or SU(A:). The Lie algebra g is a subspace of M(k , C) 
and we may consider the restrictions S r (g) and S*(g) = © ( . >0 S 1 ^(g). P e S r (g ) 
is said to be invariant if, for any geG and A,- e g, / satisfies 

P(Ad ? A\, , Ad ? A,) — P(Ai,...,A r ) (11.3) 

where Ad g Aj = g~ l Ajg. For example, 

P(A U A 2 , . . . , A,.) = str(A i , A 2 A,.) 

1 x - 

= — 2_, tr (^P(D’ Ap(2), • ■ • 7 Ap( r )) 
r ' p 


(11.4) 



is symmetric, r-linear and invariant, where ‘str’ stands for the symmetrized trace 
and is defined by the last equality. The set of G-invariant members of S' ( g ) is 
denoted by I r (G). Note that gi = 92 does not necessarily imply /' (G 1) = 
/' (G 2). The product defined by ( 1 1 . 2 ) naturally induces a multiplication 

I P {G) ® I q (G) -> I p+q {G). ( 11 . 5 ) 

The sum I*(G) = 0 , >o ^ (G) is an algebra with this product. 

Take P e P (G). The shorthand notation for the diagonal combination is 

P(A) = A e g. ( 11 . 6 ) 

r 

Clearly, P is a polynomial of degree r and called an invariant polynomial. P is 
also Ad G-invariant, 

P(Ad g A) — P(g~ { Ag) — P(A) Aeg.geG. ( 11 . 7 ) 

For example, tr(A') is an invariant polynomial obtained from ( 11 . 4 ). In general, 
an invariant polynomial may be written in terms of a sum of products of P r = 

tr(A'). 

Conversely, any invariant polynomial P defines an invariant and symmetric 
r-linear form P by expanding P(t\A\ + • • • + t r A r ) as a polynomial in r, . Then 
1/r! times the coefficient of t\ ti - ■ ■ t r is invariant and symmetric by construction 
and is called the polarization of P . Take P{A) = tr(A 3 ), for example. Following 
the previous prescription, we expand tx{t\A\ + f 3 ^ 4 3 ) 3 in powers of t\, t2 

and f3. The coefficient of t\tiG is 


tr(Ai A2A3 + Ai A3A2 + A2A1A3 + A2A3A1 + A3A1 A2 + A3A2A1) 

— 3 tr(Ai A2A3 + A2A1 A3) 

where the cyclicity of the trace has been used. The polarization is 

P(Ai, A2, A3) = j tr(Ai A2A3 + A2A1A3) = str(Aj, A2, A3). 

In the previous chapter, we introduced the local gauge potential A = A /( dr 11 
and the field strength T = jT /X i, dx ,i A dx v on a principal bundle. We have 
shown that these geometrical objects describe the associated vector bundles as 
well. Since the set of connections {Aj} describes the twisting of a fibre bundle, 
the non-triviality of a principal bundle is equally shared by its associated bundle. 
In fact, if ( 10 . 57 ) is employed as a definition of the local connection in a vector 
bundle, it can be defined even without reference to the principal bundle with which 
it is originally associated. Later, we encounter situations in which use of vector 
bundles is essential (the Whitney sum bundle, the splitting principle and so on). 



Let P{M, C) be a principal bundle. We extend the domain of invariant 
polynomials from g to g-valued p-forms on M. For A ; iy (A,- e g. q e 
£l Pi {M)\ 1 < i < r), we define 

P(Mm, • • • , A r i] r ) = r] i A ... A i] r P{A l , A,). (11.8) 

For example, corresponding to (1 1 .4), we have 

str(A 1 771 A r rj r ) = r)\ A ... A r) r str(Ai, . . . , A r ). 

The diagonal combination is 

P(Arf) = r] A . . . A r) P{A). (11.9) 


The action P or P on general elements is given by the r -linearity. In particular, 
we are interested in the invariant polynomial of the form PPS) in the following. 
The importance of invariant polynomials resides in the following fundamental 
theorem. 

Theorem 11.1. (Chern-Weil theorem) Let P be an invariant polynomial. Then 
P(T) satisfies 

(a) dP(T) = 0. 

(b) Let T and 9“' be curvature two-forms corresponding to different 
connections A and A! . Then the difference P ( T' ) — P(1 F) is exact. 

Proof, (a) It is sufficient to prove that d/A'J) = 0 for an invariant polynomial 
P r CJ) which is homogeneous of degree r. since any invariant polynomial can be 
decomposed into homogeneous polynomials. First consider the identity, 

Pr(gf l X lgt , . . . , gf'x r8r ) = P r (Xu..., X r ) 

where g, = exp(t X) and X , X, e g. By putting t — 0 after differentiation with 
respect to t, we obtain 

r 

Pr(X t, ...,[Xj,X],..., X r ) = 0. (11.10) 

1=1 

Next, let A be a g-valued p-form and Q, be a g-valued p,-form (1 < i < r). 
Without loss of generality, we may take A = Xjj and Qi = Xj rg where X , X t eg 
and i] On) is a p-form (p,-form). Define 


[^i, A] = in A r][Xi, X] 

= XiX{ m All)- (-1 yr'xXiin A m). 


(ii.il) 



Let us note that 


Pr (£2i , . . . , [Qj , A], . . . , £2 r ) 

= 171 A . . . A rji A rj A . . . A ly-PriX 1, . . . , XjX, X r ) 

— (—l) pp ‘ 

...Aij r P r (Xi,... I XX i ,...,X I .) 

= 1) A 1)1 A ... A 7J r (— l)^ 1_t ^'3 

x P r (X l ,...,[X i ,X],...,X r ). 

From this and (11.10), we find 

r 

Y_y-\y^ xJr - +Pi) P r isi \, . . . , [Qi, a], ... , ^,0 = 0. (11.12) 

i=i 

Next, consider the derivative, 

d P, (£2i, . 12, ■) = d(?n A ... A T] r )P r {X\, ...,X r ) 

r 

= J2(- 1 ) {Pl+ "' +P, ~ l) (m A ... A drjj A ... A T} r ) 
i= 1 

x P r {X l ,...,X i ,...,X r ) 

r 

= J](-l) (pi+ ' ' +p ‘- l) P r (£2u...,dn i ,...,£2 r ). (11.13) 

i=l 

Let A — A and 12; = 3 in (11.12) and (11.13) for which p — 1 and pt — 2. By 
adding 0 of the form ( 1 1 . 1 2) to ( 1 1 . 1 3) we have 

dPr (3, . . . , 30 

r 

= ^[P r (3, . . . , d3, . . . , SO + P r (2 , . . . , [A, 3], . . . , 3")] 

i= 1 
r 

= J]P r (3, ...,£>3,..., 3') = ° (11.14) 

i = 1 

since £>3 = d3 + [A, 3] = 0 (the Bianchi identity). We have proved 
dP, (3) = d P r (3, . . . , 3) = 0. 

(b) Let A and A' be two connections on E and let 3 and 3' be the respective 
field strengths. Define an interpolating gauge potential A t , by 

A,=A + te 0 = (A' - A) 0 < t < l (11.15) 



so that Aq=A and A{ — A! . The corresponding field strength is 


S, = dA, +A, A A, = S+tVO + t 2 0 2 (11.16) 


where VO — dd + [A, Q\ = d9 + A /\ 6 + 0 /\ A. We first note that 


PAS') - PAS) = PAS 1) - PA? o) = 


7 ' 


r l d 

= / d t-Pr&t) 
Jo dr 


= r I d tP r (-St,S t ,...,S t ). 


(11.17) 


From (1 1 . 16), we find that 


^-PA?t) = rPAVO + 2 tO 2 , S t , S t ) 

dr 

= rP r (V0, 3,,..., S t ) + 2 rtPAO 2 , S t , . . . , S t ). (11.18) 


Note also that 


VI, = dS, + [A, S,] = ~[A t , S,i + [A, 9J] = t[S„ 0] 


where use has been made of the Bianchi identity V t S, — dS, + [A,, 9>] = 0. [V 
is the covariant derivative with respect to A while V, is that with respect to A t .\ 
It then follows that 

d [PA0,% 3))] 

= Pride, S t ,..., S,) - (r - 1 )P r (6, dT, S,) 

= PriVO, S„ . . . , S,) - (r - 1 )PA0, VS ,, . . . , S,) 

= PAVO, S t , . . . , S,) - (r - l)f/V(0, [T r , 0], S„..., T f ) (11.19) 


where we have added a 0 of the form (11.12) to change d to V. If we take 
Qi = A = 6,^2 = • •• — dm = T r in (11.12), we have 

2 PAO 2 , S t ) + (r - 1 )PA9, [S t , 6], S, S t ) = 0. 

From (11.18), (11.19) and the previous identity, we obtain 

^PASt) = rd[PAe,S t ,...,S t )l 
dr 


We finally find that 


PAS') - PAS) = d r 



P r (A' -A, St,...,S t )dt 


(11.20) 


This shows that P r (S') differs from P,(S) by an exact form. 


□ 



We define the transgression T P r {A ! , A) of P, by 


TP r (A',A) 



dtP r (A' -A,T t ,...,T t ) 


( 11 . 21 ) 


where P r is the polarization of P r . Transgressions will play an important role 
when we discuss Chern-Simons forms in section 11.5. Let dimM = m. Since 
P m O differs from P m (S' ) by an exact form, their integrals over a manifold M 
without a boundary should be the same: 



PmO 0 = 



d TP m (A', A) = 



P m (A',A) = 0. (11.22) 


As has been proved, an invariant polynomial is closed and, in general, non- 
trivial. Accordingly, it defines a cohomology class of M. Theorem 11.1(b) 
ensures that this cohomology class is independent of the gauge potential chosen. 
The cohomology class thus defined is called the characteristic class. The 
characteristic class defined by an invariant polynomial P is denoted by xe(P ) 
where £ is a fibre bundle on which connections and curvatures are defined. 
[Remark: Since a principal bundle and its associated bundles share the same 
gauge potentials and field strengths, the Chern-Weil theorem applies equally to 
both bundles. Accordingly, E can be either a principal bundle or a vector bundle.] 


Theorem 1 1.2. Let P be an invariant polynomial in /* (G) and E be a fibre bundle 
over M with structure group G. 


(a) The map 


: I*(G) H*(M) 


(11.23) 


defined by P -* xe(P ) is a homomorphism (Weil homomorphism). 

(b) Let / : N — »■ M be a differentiable map. For the pullback bundle f*E of 
E, we have the so-called naturality 


Xf*E = f*XE- (11.24) 

Proof, (a) Take P, e /' (G) and P s e PAG). If we write S' = T a T a , we have 

(P r P s )( S') = S' 0 ” 1 A ... A S'“ r A S" ft /\ . . . /\ S* 

x ~ — TT Pr (Tax i • • • 7 Ta r ) Pn (Tp l , ... , Tp s ) 

(r + j): 

= PAT) A PAT). 

Then (a) follows since P r (T), P S (T) e H*(M). 

(b) Let A be a gauge potential of E and S' = d A + A A A. It is easy to verify 
that the pullback f*A is a connection in f*E. In fact, let A\ and Aj be local 
connections in overlapping charts G, and U j of M. If tjj is a transition function 



on Ui fl Uj , the transition function on f*E is given by f*ttj = tjj o f. The 
pullback f*A, and f*Aj are related as 

rAj = ru u 'A ili j+, i j\\, U ) 

= (/*0(/*A)(/%) + (r0(df*tij). 

This shows that f*A is, indeed, a local connection on f*E. The corresponding 
field strength on f*E is 

d(f*Ai) + f*Ai A f*Ai = f*[dAi + A, A A,] = /*S). 

Hence, /*/>(?,) = P(f*?i), that is /*Xe(P) = Xf*E(P)- □ 

Corollary 11.1. Characteristic classes of a trivial bundle are trivial. 

Proof. Let E —*■ M be a trivial bundle. Since E is trivial, there exists a map 
/ : M — > {/?} such that E — f* E q where Eo — >■ {/?} is a bundle over a 
point p. All the de Rham cohomology groups of a point are trivial and so are the 
characteristic classes. Theorem 1 1.2(b) ensures that the characteristic classes xe 
(— f*XE 0 ) of E are also trivial. □ 

11.2 Chern classes 
11.2.1 Definitions 

Let E — %• M be a complex vector bundle whose fibre is C* . The structure group 
G is a subgroup of GL(£, C), and the gauge potential A and the field strength S' 
take their values in g. Define the total Chern class by 

c(30 = det (i + \ ■ (H-25) 

Since S' is a two-form, els') is a direct sum of forms of even degrees, 

c(3T = 1 + ci (S') + c 2 dT + • • • (11.26) 

where cj( S') e Q 2j ( M ) is called the / th Chern class. In an m-dimensional 
manifold M, the Chern class Cj (S) with 2 j > in vanishes trivially. Irrespective 
of dimM, the series terminates at cif S') = det(iS'/27r) and cy(S') = 0 for j > k. 
Since Cj (S) is closed, it defines an element [cy (S')] of H 2 J(M). 


Example 11.1. Let F be a complex vector bundle with fibre C 2 over M, where 
G = SU(2) and dimM = 4. If we write the field S' = S' a '(cr a /2i), S'" = 
lu , dx ,x A d.r y , we have 



/ 1 + (i/27T)(ST 3 /2i) (i/27r)(J 1 - i3 r2 )/2i \ 

V (i/2^')(9" 1 + i(F 2 )/2i 1 - (i/27r)(T 3 /2i) / 

= i + -J- (t 3 /\ T 3 + 3* I /\J 1 + T 2 A T 2 ) . (1 1.27) 

Individual Chern classes are 


c 0 (S) = 1 

ci (90 = 0 


c 2 (?) = 



•J 01 Af 
4 



(11.28) 


Higher Chern classes vanish identically. 


For general fibre bundles, it is rather cumbersome to compute the Chern 
classes by expanding the determinant and it is desirable to find a formula which 
yields them more easily. This is done by diagonalizing the curvature form. 
The matrix form S' is diagonalized by an appropriate matrix g e GL(£, C) as 
g~ l (iS'/2jt)g — diag(.vi, . . . , Xk), where Xj is a two-form. This diagonal matrix 
will be denoted by A. For example, if G = SU(/c), the generators are chosen to be 
anti-Hermitian and a Hermitian matrix iff /2tx can be diagonalized by g € SU (k). 
We have 


det(7 + A) — det[diag(l + x\, 1 + X 2 , . . . , 1 + -fit)] 
k 

= na + ^> 

7 = 1 

= 1 + (xi H b Xk) + (x\X 2 b Xk-\Xk) 

H b (x\X2 H b Xk) 

= l + trA+ j{(tr A) 2 — tr A 2 } + • • • + det A. (11.29) 
Observe that each term of (1 1.29) is an elementary symmetric function of {xj}, 

So(xj) = I 


k 

Si(xj j) = J2xj 
7 = 1 

Si(xj) = y xjxj 

i<j 


(11.30) 


Sk(xj) = X\X2 . ..Xk. 



Since det (/ + A ) is an invariant polynomial, we have P(3 F) = /Tg.Tg 1 ) = 
P(2nA/i), see (11.7). Accordingly, we have, for general T, 


coOO = 1 


ci(T) = trA = trfg^g ') = 2^ tr3r 

c 2 ( S') = i[(trA) 2 - trA 2 ] = ^(i/27r) 2 [tr S' A tr T - tr(T A 30] (1L31 ) 


c k (T) = det A= (i/2jr) fc det^. 

Example 11.1 is easily verified from (11.31). [Note that the Pauli matrices (in 
general, any element of the Lie algebra su(n) of SU(«)) are traceless, tr o a = 0.] 

11.2.2 Properties of Chern classes 

We will deal with several vector bundles in the following. We often denote the 
Chern class of a vector bundle E by c(E). If the specification of the curvature is 
required, we write c(9\e). 

Theorem 1 1.3. Let E — M be a vector bundle with G — GL(k. C) and 
F = C*. 

(a) (Naturality) Let / : N -> M be a smooth map. Then 

c(f*E) = f*c(E). (11.32) 

(b) Let F —>■ M be another vector bundle with F — d and G — GL(/, C). 
The total Chern class of a Whitney sum bundle E © F is 

c(E ® F) = c(E) Ac(F). (11.33) 


Proof. 

(a) The naturality follows directly from theorem 1 1.2(a). Since the curvature 
of f*E is T f*E = f*$E, the total Chern class of f*E is 

c(/*£) = tlet (/ + A J />E ) = det (/ + A / *S' e ) 

= /* det (/ + ^^) = /*c(£). 

(b) Let us consider the Chern polynomial of a matrix 



[Note that the curvature of a Whitney sum bundle is block diagonal: $e®f — 
diag(3“£, 9V)-] We find that 


-) 

2n 


det I I 

\ 

= det I / 


) = det 


is 

2n 


i B 
2tt 


0 

/ 

det I I 


0 

' + § 

i C 

2i r 


= c(B)c(C). 


This relation remains true when B and C are replaced by If p and £F p, namely 
c(3e®f) — c(3e) a c(9» 


which proves (11.33). □ 

Exercise 11.1. (a) Let £ be a trivial bundle. Use corollary 11.1 to show that 

c(£) — 1. (11.34) 

(b) Let £ be a vector bundle such that £ = E\ © Ei where E\ is a vector 
bundle of dimension k\ and Ei is a trivial vector bundle of dimension kj. Show 
that 

c, (£) = 0 k\ + 1 < i < k\ + ki- (11.35) 

11.2.3 Splitting principle 

Let £ be a Whitney sum of n complex line bundles, 

£ = L\ © £2 © • • • © L n . (1 1.36) 

From (1 1.33), we have 

c(£) = c(£i)c(£ 2 )...c(£„) (11.37) 

where the product is the exterior product of differential forms. Since c, (L) — 0 
for r > 2, we write 

c(£«)= l + ci(£;)= l+Xi. (11.38) 

Then (11 .37) becomes 

n 

c(£) = ]“[ (1 + x ')- ( 1L39) 

!= 1 

Comparing this with (11.29), we find that the Chern class of an « -dimensional 
vector bundle £ is identical with that of the Whitney sum of n complex line 
bundles. Although £ is not a Whitney sum of complex line bundles in general, 
as far as the Chern classes are concerned, we may pretend that this is the case. 
This is called the splitting principle and we accept this fact without proof. The 
general proof is found in Shanahan (1978) and Hirzebruch (1966), for example. 



Intuitively speaking, if the curvature ,T is diagonalized, the complex vector 
space on which g acts splits into k independent pieces: Cr — > C © • • • © C. An 
eigenvalue x,- is a curvature in each complex line bundle. Since diagonalizable 
matrices are dense in M(«, Q, any matrix may be approximated by a diagonal 
one as closely as we wish. Hence, the splitting principle applies to any matrix. As 
an exercise, the reader may prove (1 1.33) using the splitting principle. 

11.2.4 Universal bundles and classifying spaces 

By now the reader must have some acquaintance with characteristic classes. 
Before we close this section, we examine these from a slightly different point of 
view emphasizing their role in the classification of fibre bundles. Let E — M 
be a vector bundle with fibre Cr . It is known that we can always find a bundle 

E M such that 

£©£ = MxC (11.40) 

for some n > k. The fibre F p of E at p e M is a A-plane lying in C" . Let G k , n (Q 
be the Grassmann manifold defined in example 8.4. The manifold G k ,n( Q is 
the set of A -planes in C" . Similarly to the canonical line bundle, we define the 
canonical A-plane bundle Lk, n (C) over Gk, n (O with the fibre Cr . Consider a 
map / : M — > Gk.n (C) which maps a point p to the A-plane F p in C" . 

Theorem 11.4. Let M be a manifold with dimM = m and let E M be a 
complex vector bundle with the fibre C k . Then there exists a natural number N 
such that for n > N, 

(a) there exists a map / : M — >• Gk.n (Q such that 

E = f*Lk,n(Q (11.41) 

(b) f*L k .n(C) = g*Lk.n(Q if and only if f,g : M G k . n ( Q are 
homotopic. 

The proof is found in Chern (1979). For example, if E — ^ M is a complex 

line bundle, then there exists a bundle E M such that £©£ = Mx(" and 
a map / : M — >■ G i, n (Q = CP n ~ l such that E — f*L , L being the canonical 
line bundle over CP" -1 . Moreover, if / ~ g, then f*L is equivalent to g*L. 
Theorem 1 1 .4 shows that the classification of vector bundles reduces to that of 
the homotopy classes of the maps M -> G k . n { Q- 

It is convenient to define the classifying space G*(C). Regarding a A-plane 
in C" as that in C" +1 , we have natural inclusions. 


Gk.k (C) Gjt,(t+i(C) ^ • • • c -»- G*(C) 


(11.42) 



where 


OO 

g*(Q = U g *.»(Q- (n.43) 

n=k 

Correspondingly, we have the universal bundle L & — > Gt(C) whose fibre is 
. For any complex vector bundle E — % M with fibre C k , there exists a map 
/ : M — * Gk(C) such that E — f*Lk( Q. 

Let E — M be a vector bundle. A characteristic class / is defined as a 
map x ; E — > /(£’) e H*(M) such that 

X(f*E) = f*x(E) (naturality) (11.44a) 

X(E) = x(E') if E is equivalent to E' . (11.44b) 

The map f* on the LHS of (11.44a) is a pullback of the bundle while f* on 
the RHS is that of the cohomology class. Since the homotopy class [/] of 
/ : M — » Gk ( .Q uniquely defines the pullback 

f* : H*(G k ) -* H*(M) (11.45) 

an element /(£) = f*x(Qk ) proves to be useful in classifying complex vector 
bundles over M with dim E — k. For each choice of x(Gk), there exists a 
characteristic class in E. 

The Chern class c ( E ) is also defined axiomatically by 

(i) c(f*E) — f*c(E) (naturality) (11.46a) 

(ii) c(E ) = cq(E) ® ci (£)©••• ® Ck(E) 

Ci (E ) e H 2i (M ); Ci (E) = 0 i > k (11.46b) 

(iii) c(E © F) — c(E)c(E) (Whitney sum) (11.46c) 

(iv) c(L) — l+x (normalization) (11.46d) 

L being the canonical line bundle over CP" . It can be shown that these axioms 
uniquely define the Chern class as (1 1.25). 

11.3 Chern characters 
11.3.1 Definitions 

Among the characteristic classes, the Chern characters are of special importance 
due to their appearance in the Atiyah-Singer index theorem. The total Chern 
character is defined by 


(VS 

ch(T) = trexp I — 




„(P XJ 


j = 1 


j\ \2 TX 


(11.47) 



The y'th Chern character ch 7 (T) is 


iT 

2it 


1 / i<T \J 

ch,(T) = — tr | 


(11.48) 


If 2 y > m — dim M, ch, (T) vanishes, hence ch(T) is a polynomial of finite order. 
Let us diagonalize T as 


iT 

2k 




g = A = diag(xi ,...,Xk) 


g e GL (k, C). 


The total Chern character is expressed as 


k 

tr[exp(/\)] = y^exp (xj). (11.49) 

7=1 

In terms of the elementary symmetric functions S r (xj), the total Chern character 
becomes 


iz ex p ( */) =iz( i+x j + krf + jrf + " ■ ) 

= k + S l (x j )+Z[ Sl (x j ) 2 -2S 2 (x j )] + ••• . (11.50) 

Accordingly, each Chern character is expressed in terms of the Chern classes as 


ch 0 (T) = £ (11.51a) 

chi (9“) = ci (T) (11.51b) 

ch 2 (30 = ^[ci(T) 2 - 2c 2 (T)] (11.51c) 


where k is the fibre dimension of the bundle. 

Example 11.2. Let P be a U(l) bundle over S 2 . If An and As are the local 
connections on I/n and Us defined in section 10.5, the field strength is given by 
T ( = dA; (i = N, S). We have 

ch(30 = 1 + ^— (11.52) 

2k 

where we have noted that T" = 0 (n > 2) on S 2 . This bundle describes the 
magnetic monopole. The magnetic charge 2 g given by (10.94) is an integer 
expressed in terms of the Chern character as 

N=Z~ f 3= f chi(T). (H-53) 

2 K J S 2 Js 2 



Let P be an SU(2) bundle over ,S 4 . The total Chern class of P is given by 
(1 1.27). The total Chern character is 




(11.54) 


Ch(fT) terminates at ch^OT ) since S" — 0 for n > 3. Moreover, tr? = 0 for 
G = SU(2), n > 2. As we found in section 10.5, the instanton number is given 
by 


L-SH 


= / ch 2 (?). 

s 4 


(11.55) 


In both cases, ch ; measures how the bundle is twisted when local pieces are 
patched together. 

Example 11.3. Let fbeaU(l) bundle over a 2m -dimensional manifold M. The 
mth Chern character is 


— tr . — , 

m ! V 27 x ) 


ltr^Y" = ±(-LY 

m\ \ 2?r / 

—f-T 

m\ \47r/ 

i V 


ij^dx^ A dx" 


S^ n . . . S„ mVm dx^ 1 A dx Vl A ... A dx^ A dx" 


= 1 — 1 ■ ■ ■ 3„ m v m dx 1 A . . . A dx 


2m 


4tt 7 


which describes the U(l) anomaly in 2m -dimensional space, see chapter 13. 
Example 11.4. Let L be a complex line bundle. It then follows that 


(MS 


\2: x 


MS 


ch(L) = trexp I — ) = e* = 1 + x x = — . 


2jt 


(11.56) 


For example, let L CP 1 be the canonical line bundle over CP 1 = .S' 2 . The 
Fubini-Study metric yields the curvature 

d z A dz 


S = —33 ln(l + |z| ) = 

(l + zzr 

see example 8.8. In real coordinates z = x + iy = r exp(i0), we have 


S = 2i- 


dx A dy 


= 2i- 


r dr A d 0 


(l+x 2 + y 2 ) 2 (1+r 2 ) 2 

From ch(T) = 1 + tr(i? /2ir), we have 

1 r dr Add 

ch| (T) = — 


7T (1 T r 2 ) 2 


Chi (L), the integral of chi (?) over S 2 is an integer. 


1 f rdrdO f c 


t~ 2 dr = -1. 


(11.57) 


(11.58) 


(11.59) 


(11.60) 


11.3.2 Properties of the Chern characters 

Theorem 11.5. (a) (Naturality) Let E — % M be a vector bundle with F — 

C k . Let / : N -> M be a smooth map. Then 

ch(/*£) = /*ch(£). (11.61) 

(b) Let E and F be vector bundles over a manifold M. The Chern characters 
of E <g> F and E © F are given by 

ch(E <g> F) = ch(E) A ch(F) (11.62a) 

ch(£ © F) = ch(£) ©ch(F). (11.62b) 


Proof, (a) follows from theorem 1 1.2(a). 

(b) These results are immediate from the definition of the ch-polynomial. 
Let 

fi A\ J 


1 /id 

ch ( A ) = E-i tr (*r 


j! \2ttJ 


be a polynomial of a matrix A. Suppose A is a tensor product of B and C, 
A — B ® C — B ® I + I ® C (note that Te®f — $ E <E> / + / <8> 3V)- Then we 
find that 


ch(B <g> C) 


^ 1 / i V 

y — — tr ( 5 ® ® cy 

j J ■ \2jr ) 
j ' v 7 m = 1 v 7 

1 1 / iC \" 

J]— tr — E~ tr — =ch(B)ch(C). 
z — ' m ! \2tt / z — ' n ! \2jt J 

m ' ' n ' ' 


Equation (1 1.62a) is proved if B is replaced by ‘Je and C by 3>- 
If A is block diagonal. 


A = 


B 0 
0 C 


B © C 


we have 

ch (B © C) = 



tr(fi © C) j 

[tr(fi ; ) + tr(C 7 )] = ch(fi) + ch(C). 


This relation remains true when A, B and C are replaced by T e®f, '2> i-: and T F 
respectively. □ 



Let us see how the splitting principle works in this case. Let L j (1 < j < k) 
be complex line bundles. From (1 1.62b) we have, for E — L\ © Li © • • • © 

ch(£) = ch(Li) © ch(L 2 ) © • • • © ch(L^). (11.63) 

Since ch(L,) = expOr,-), we find 

k 

ch(£) = |~ [ exp(Xj) (11.64) 

7=1 

which is simply (11.50). Hence, the Chern character of a general vector bundle E 
is given by that of a Whitney sum of k complex line bundles. The characteristic 
classes themselves cannot differentiate between two vector bundles of the same 
base space and the same fibre dimension. What is important is their integral over 
the base space. 

11.3.3 Todd classes 

Another useful characteristic class associated with a complex vector bundle is the 
Todd class defined by 

w) = n 1 XJ ~x- ( il65) 

j 

where the splitting principle is understood. If expanded in powers of xj, Td(ff ) 
becomes 

Td W =n(i + ^ + g(-i/-‘^f) 

= 1 + \ + T2 xi */ + \ x x J xk ^ — 

j j j<k 

= 1+ 1ci(T)+^[ C i(T) 2 + C2 (3')] + --- (11.66) 

where the are the Bernoulli numbers 

= 6 ^2 — 3(j ^3 — 42 — 30 ^5 — ^ 


The first few terms of ( 1 1 .66) are: 

Td 0 (T) = 1 (11.67a) 

Tdi(T)=ici (11.67b) 

Td 2 (T) = j^(cj + c 2 ) (11.67c) 

Td 3 (T)=^cic 2 (1 1.67d) 

Td 4 (T) = 72 o(-Ci + 4qc2 + 3 c\ + cic 3 - c 4 ) (11.67e) 

Td 5 (T) = ^q(-CjC 2 + 3cic| + CjC 3 - cic 4 ) (11.67f) 



where c,- stands for c, (S'). 


Exercise 11.2. Let E and F be complex vector bundles over M. Show that 

Td(E©£) = Td(E) ATd(E). (11.68) 


11.4 Pontrjagin and Euler classes 

In the present section we will be concerned with the characteristic classes 
associated with a real vector bundle. 


11.4.1 Pontrjagin classes 

Let £ be a real vector bundle over an m -dimensional manifold M with dimp E — 
k. If E is endowed with the fibre metric, we may introduce orthonormal frames 
at each fibre. The structure group may be reduced to O (k) from GL(A:, E). Since 
the generators of o (k) are skew symmetric, the field strength T of E is also skew 
symmetric. A skew-symmetric matrix A is not diagonalizable by an element of a 
subgroup of GL(k, M). It is, however, reducible to block diagonal form as 


/ 0 M 

-ki 0 


A 


0 \ 


0 Xj 

—Xj o 


V o 

/ ill 


— iA. i 


1 X 2 



— 1 X 2 


Vo J 


(11.69) 


where the second diagonalization is achieved only by an element of GL(k, Q. If 
k is odd, the last diagonal element is set to zero. For example, the generator of 
0 (3) = so(3) generating rotations around the z-axis is 


T z = 


0 1 0 \ 

-10 0 

0 0 0 / 


The total Pontrjagin class is defined by 


/ 2 (T) = det 


T 

2it 


(11.70) 



From the skew symmetry 3 1 = — T, it follows that 


det I / 


S' 

2tt 


= det I / 


S' 

27T 


= det I / — 


S 

27T 


Therefore, p(S) is an even function in S. The expansion of p(S) is 

p(S) = 1 + P1 (S) + p 2 (S) + • • • (11.71) 

where p j (S) is a polynomial of order 2 j and is an element of H 4 HM\ M). We 
note that /J/(S) = 0 for either 2 j > k — dim£ or 4 j > dimM. 1 
Let us diagonalize S /27r as 


S 

2n 


A = 


( -m 


V 


Lri 0 

— 1X2 

0 i.T2 


\ 


• J 


(11.72) 


where Xk = —Xk/2it, Xk being the eigenvalues of S. The sign has been chosen 
to simplify the Euler class defined here. The generating function of p(S) is given 

by 


S/2] 


P(S) = det(/ + A) = n (!+*?) 


(11.73) 


i = 1 


where 


W 2] =- 


k/2 if k is even 

I (k — l)/2 if k is odd. 


In (11.73) only even powers appear, reflecting the skew symmetry. Each 
Pontrjagin class is computed from (1 1.73) as 


S/2] 

Pi&)= E 


(11.74) 


*1 <l2< — <lj 

To write Pj{^) in terms of the curvature two-form T/27T, we first note that 


tr 


qr X 2 j S/2] . 

' =trA 2 ^ =2(-l)2 J2 x f J - 

i= 1 


2n / 


1 Although p m ( 30 = 0, p m (B ) need not vanish for a matrix B. p m will be used to define the Euler 
class later. 



It then follows that 


mm 

Pi(T) 


wff) 


P4&) 



i<j<k 


1 / l \ 6 

— I — I [-(trJ 2 ) 3 +6tr3 r2 tr3' 4 -8trT 6 ] 
48 \27r ) 

E 2 2 2 2 
X i X j X k*l 

i< j <k<l 


(11.75a) 


(11.75b) 


(11.75c) 


1 / 1 V 

— ) [(trT 2 ) 4 - 12(tr (F 2 ) 2 tr 3" 4 + 32 tr T 2 tr T 6 

384 \27r / 

+ 12(tr J 4 ) 2 -48tr3' 8 ] (11.75d) 


P[k/ 2 \&) = x\x\ . . .xl k/2] = (±) *t* (1 1.75e) 

The reader should verify that 

p(E ® F) = p{E) A p(F). (11.76) 

It is easy to guess that the Pontrjagin classes are written in terms of Chern 
classes. Since Chern classes are defined only for complex vector bundles, we must 
complexify the fibre of E so that complex numbers make sense. The resulting 
vector bundle is denoted by E . Let A be a skew-symmetric real matrix. We find 
that 


det(7 + \A) — det 


/ 1 + x\ 0 \ 

1 — x\ 

1 +X2 

0 1 — X2 


V 


■■ / 


[k/2] 


= Y\(l-x?)=l-pi(A) + p 2 (A) 


i=i 



from which it follows that 


Pj(E) = (-iy C2 j(E c ). (11.77) 

Example 11.5. Let M be a four-dimensional Riemannian manifold. When the 
orthonormal frame {e a } is employed, the structure group of the tangent bundle 
T M may be reduced to 0(4). Let 1Z — \lZ a p9 a A 9^ be the curvature two-form 
(1Z should not be confused with the scalar curvature). For the tangent bundle, it is 
common to write p(M) instead of p(7Z). We have 

det ( I + — ) = 1 ^rh-K 2 + — !— [(\xlZ 2 ) 2 -2\xlZ A ]. (11.78) 

V 2: rj 87 r 2 128nr 4 

Each Pontrjagin class is given by 

po(M) — 1 (11.79a) 

Pl (M) = -^tiTZ 2 = -^lZ a pTZp a (11.79b) 

P2(m) = Y^4 [(tr7e2)2 - 2tr7?4] = (^) det7? - {n.i9c) 

Although pi(M) vanishes as a differential form, we need it in the next subsection 
to compute the Euler class. 

11.4.2 Euler classes 

Let M be a 2/-dimensional orientable Riemannian manifold and let TM be the 
tangent bundle of M. We denote the curvature by 1Z. It is always possible to 
reduce the structure group of T M down to SO(2 1) by employing an orthonormal 
frame. The Euler class e of M is defined by the square root of the 4/-form pi, 

e(A)e(A)= pi(A). (11.80) 

Both sides should be understood as functions of a 2 / x 21 matrix A and not of 

the curvature 1Z, since pi(lZ) vanishes identically. However, e(M ) = e(TZ) thus 
defined is a 2/-form and, indeed, gives a volume element of M. If M is an odd- 
dimensional manifold we define e(M ) = 0, see later. 

Example 11.6. Let M = S 2 and consider the tangent bundle T S 2 . From example 
7.14, we find the curvature two-form, 

9 d0 A d(p 

TZe<h — —7 Zd>e = shr 6 = sin0 d 6 A d </> 

sin0 

where we have noted that ggg — sin 2 9. Although pi(S 2 ) = 0 as a differential 
form, we compute it to find the Euler form. We have 

Pi(S 2 ) — — ^jtr 1Z 2 — — ^9 [TZg^lZ^e + JZ^gTZgi/,] 

— — sin0d0Ad</)^ 



from which we read off 


9 1 

e(S 2 ) = — sin<9d(9 Adf (11.81) 

2tt 

It is interesting to note that 

[ e(S 2 ) = - -f d </> f d6 sin 0 = 2 (11.82) 

Js 2 27T J o Jo 

which is the Euler characteristic of S 2 , see section 2.4. This is not just a 
coincidence. Let us take another convincing example, a torus T 2 . Since T 2 admits 
a flat connection, the curvature vanishes identically. It then follows that e(T 2 ) = 0 
and x(T 2 ) — 0. These are special cases of the Gauss-Bonnet theorem. 


/■ 


e(M) — x (M) 


(11.83) 


for a compact orientable manifold M. If M is odd dimensional both e and / 
vanish, see (6.39). 


In general, the determinant of a 21 x 21 skew-symmetric matrix A is a square 
of a polynomial called the Pfaffian Pf( A), 2 

det A = Pf(A) 2 . (11.84) 


We show that the Pfaffian is given by 
(- 1 )' 


PfTA) = 


2 l l\ 


y, sgn(P)Ap(i)P( 2 )Ap ( 3 )P( 4 ) . . . Ap( 2 /_i)P( 2 Z) (1 1.85) 


where the phase has been chosen for later convenience. We first note that a skew- 
symmetric matrix A can be block diagonalized by an element of 0(2 1) as 


/ 0 Aj 

— Ai 0 


S' AS — A — 


\ 


0 X 2 
— X 2 0 


0 X, 
-Xi 0 / 


( 11 . 86 ) 


It is easy to see that 


l 

det A = det A = [A 2 . 

!= 1 


2 See proposition 1.3. The definition here differs in phase from that in section 1.5. It turns out to be 
convenient to choose the present phase convention in the definition of the Euler class. 



To compute Pf(A), we note that the non-vanishing terms in ( 11 . 85 ) are of the 
form A12A34 . . . A 21- 1,2/- Moreover, there are 2 1 ways of changing the suffices as 
Ajj —*■ Aji, such as 

A12A34 . . . A 2/— 1,2/ -> A21 A34 . . . A 2/— 1,2/ 
and /! permutations of the pairs of indices, for example, 

A12A34 . . . A 2/— 1,2/ -* A34A12 . . • A2/— 1,2/- 


Hence, we have 


l 

Pf(A) = (— 1/A12A34 • ■ • A 2 / -i,2Z = (- 1 )' n 

i = 1 


Thus, we conclude that a block diagonal matrix A satisfies 

detA = Pf(A) 2 . 

To show that ( 11 . 84 ) is true for any skew-symmetric matrices (not necessarily 
block diagonal) we use the following lemma, 3 

Pf(Z'AZ) = Pf(A) det X. ( 11 . 87 ) 

If S' AS = A for Se 0 ( 2 /), we have A = SAS\ hence 

1 

Pf(SAS l ) = Pf( A) det S = (- l/ ]“[ A.,- det 5 . 

i= 1 

We finally find det A = Pf(A) 2 for a skew-symmetric matrix A. 

Note that Pf(A) is S 0 ( 2 /) invariant but changes sign under an improper 
rotation S (det S = — 1 ) of 0 ( 2 /). 

Exercise 11 . 3 . Show that the determinant of an odd-dimensional skew-symmetric 
matrix vanishes. This is why we put e(M) = 0 for an odd-dimensional manifold. 

The Euler class is defined in terms of the curvature 1Z as 

e(M) = Pf( 7 ?./ 27 r) 

= sgn(/*)7^f (i)p(2) • • ■ 'TZp(V-V)P(2i)- (11.88) 

3 Since det(X l AX) = (det X ) 2 det A, we have Pf(X l AX) = ±Pf(A) det X. Here the plus sign should 
be chosen since Pf (I l AI) = Pf(A). 



The generating function is obtained by taking xj = —A /In, 

l 

e{x)=x\X 2 ...xi = Y\*i- (11.89) 

i=l 

The phase (— 1) ; has been chosen to simplify the RHS. 


Example 11.7. Let M be a four-dimensional orientable manifold. The structure 
group of T M is SO(4), see example 11.5. The Euler class is obtained from (1 1 .88) 
as 


e(M) = 


2(4jt ) 


•( ' /A/ 7\// a 7\ A /. 


(11.90) 


This is in agreement with the result of example 1 1 .5. The relevant Pontrjagin class 
is 

P2{M) = T ^ l(trn2)2 ~ 2tr ^ 4] = x ' x2 ' 

Since e(M) — X 1 X 2 , we have p 2 (M) — e(M ) A e(M). This is written as a matrix 
identity, 


j-[(tr A 2 ) 2 — 2tr A 4 ] = 


2(4jr) 4 


: ijk, A U A kl 



11.4.3 Hirzebruch L-polynomial and A-genus 

The Hirzebruch L-polynomial is defined by 

k 


L(x) — 


Ff — 

1 1 tanhx 
7 = 1 



(11.91) 


where the B„ are Bernoulli numbers, see (1 1.66). The function L(x) is even in xj 
and can be written in terms of the Pontrjagin classes, 

L(T) = 1 + \p x + ±{-p\ + 7/72) + ^(2 P \ - 13/7^2 + 62/73) + • • • (1 1.92) 

where pj stands for /?/(T). From the splitting principle, we find that 


L(E © F) = L(E) A L(F). 
The A (A-roof) genus A (S') is defined by 


(11.93) 


Am = FT - 

A X Cl 


Xj/ 2 


sinh (xj/2) 


7=1 


= n 


n> 1 


(11.94) 



This is an even function of x j and can be expanded in pj. A is also called the 
Dirac genus by physicists. It satisfies 

A{E ®F) = A(E) A A(F). (11.95) 

A is written in terms of the Pontrjagin classes as 

Am = i-±p l + ^ m (ip 2 l -4 P2 ) 

+ 96T680^ _31 Pi + 44 P1P2 - i6p 3 ) H . (11.96) 

Example 11.8. Let M be a compact connected and orientable four-dimensional 
manifold. Let us consider the symmetric bilinear form a : H 2 (M; M) x 
H 2 (M\ 1) -> M defined by 

f a Afi. (11.97) 

J M 

a is a b 2 x b 2 symmetric matrix where b 2 — dim H 2 (M\ M) is the Betti number. 
Clearly a is non-degenerate since cr ([or], [/!]) = 0 for any [a] e H 2 (M\ IK) 
implies [/l] = 0. Let p (q) be the number of positive (negative) eigenvalues of cr. 
The Hirzebruch signature of M is 

r (M) = p-q. (11.98) 

According to the Hirzebruch signature theorem (see section 12.5), this number 
is also given in terms of the L-polynomial as 

r (M)= f L\(M) = \ f pi(M). (11.99) 

Jm Jm 


11.5 Chern-Simons forms 
11.5.1 Definition 

Let Pj(Z T) be an arbitrary 2j -form characteristic class. Since Pj (IT ) is closed, it 
can be written locally as an exact form by Poincare’s lemma. Let us write 

P j m = dQ 2j -i(A,3') (11.100) 

where Qij-i (-A, T) e g <8> f2 2/_1 (M). [Warning: This cannot be true globally. If 
Pj — dQ 2 j-i globally on a manifold M without boundary, we would have 

I Pm/2 — I dQ m — \ = I Qm — 1 = 0 
jm Jm Jsm 

where m = dimM.] The 2j — 1 from 02 / -I (A, T ) is called the Chern-Simons 
form of Pj (T). From the proof of theorem 1 1.2(b), we find that Q is given by the 
transgression of Pj, 

Q 2j -i(A, T) = TP, (A. 0 ) = jf Pj(A , T,, . . . , T r ) d t 

Jo 


( 11 . 101 ) 



where Pj is the polarization of Pj,J — dA + Ac and we set A' = IT' = 0. Since 
Q 2 j-\ depends on IF and A, we explicitly quote the A-dependence. Of course, 
A! can be put equal to zero only on a local chart over which the bundle is trivial. 

Suppose M is an even-dimensional manifold (dimM = m = 21) such that 
3 M ^ 0. Then it follows from Stokes’ theorem that 

[ P,&)= ( dQ m -i{A,3)= f Q m -i{A,3). (11.102) 

JM JM JdM 

The LHS takes its value in integers, and so does the RHS. Thus Q m - 1 is a 
characteristic class in its own right and it describes the topology of the boundary 
3 M. 


11.5.2 The Chern-Simons form of the Chern character 

As an example, let us work out the Chern-Simons form of a Chern character 
chy (IF). The connection A, which interpolates between 0 and A is 

A, = tA (11.103) 

the corresponding curvature being 

= t dA + t 2 A 2 = + (t 2 - t)A 2 . (1 1.104) 


We find from (11.21) that 


1 ( i V 




dr str(A. T, ). 


For example, 


l, 30 = — [ dr 
27 r Jo 

/ i \ 2 f 

\2tt J Jo 


dr tr A — — tr A 
2n 


dr str(A, tdA + t 2 A 2 ) 


2\2i r 


tr f AdA + J^A 2 \ . 


Qs(A,T> = 


i /j_V r 

2\2it) Jo 

1 / i V r 

= - — tr , 

6 \2n J 


dr str[A, (tdA + t 2 A 2 ) 2 ] 


3 3 

A(dA)~ + — AdA T — A~ 
2 5 


(11.105) 


(11.106a) 


(11.106b) 


(11.106c) 


Exercise 11.4. Let IF be the field strength of the SU(2) gauge theory. Write down 
the component expression of the identity c!i 2 (T ) = dQj,(A, 3 ) to verify that (cf 
lemma 10.3) 

tr = dA2e K ^ v tr (A x d„A v + f^^v)]. 


(11.107) 



11.5.3 Cartan’s homotopy operator and applications 


For later purposes, we define Cartan’s homotopy formula following Zumino 
(1985) and Alvarez-Gaume and Ginsparg (1985). Let 

A, =A 0 + t(Ai -A 0 ) 3) = dA, +A 2 (11.108) 

as before. Define an operator l, by 

l, A t = 0 1,1, = St(Ai -Aq). (11.109) 

We require that l, be an anti-derivative, 

ItiVpCOq) = (, l t r)p)co q + (-1 ) p rj p (l t a> q ) ( 11 . 110 ) 

for r) p e Q P (M) and w q e £L q (M). We verify that 

(d I, + l,d)A, = 1,(1, - Ah = 8t(Ai - A 0 ) = 8t^- 

ot 

and 

(d/ ; + l,d)1, = d[5f(Ai - A 0 )] + l t [V,1, - A, 1 t + 1,A,] 

— 5f[d(Ai — A 0 ) + A, (A I — A 0 ) + (A\ — Ao)At\ 

33” 

= 8tV,(A\ — Aq) = 8t — — - 

at 

where we have used the Bianchi identity V,1, — 0. This shows that for any 
polynomial S(A, 3“) of A and 1, we obtain 


(d 1, + /,d)S(A„ 1,) = 8t^-S(A,, 1,). (11.111) 

at 

On the RHS, S should be a polynomial of A and 3“ only and not of dA or 
dS^: if S does contain them, dA should be replaced by 1 — A 2 and d3“ by 
VI — [A, 9"| = —[A, 3]. Integrating (11.111) over [0, 1], we obtain Cartan’s 
homotopy formula 

S(Ai, h) - S(Ao, To) = (d/co, + %d)S(A f , 1,) (11.112) 

where the homotopy operator k{\\ is defined by 

koiS(A,,1,)= [ 8t l,S(A,, 1,). (11.113) 

Jo 

To operate A'oi on ,S'(A, 3“), we first replace A and 1 by A, and 1,, respectively, 
then operate l, on S’ (A,, 3)) and integrate over t. 



Example 11.9. Let us compute the Chern-Simons form of the Chern character 
using the homotopy formula. Let ,S(A, S) = ch/+i(S) and A \ — A, Ao = 0. 
Since dchy+ifS) = 0, we have 

ch j+1 (J) = (d% + * 0 id)ch/ +1 (S t ) = d[%ch ;+ i(S,)]. 


Thus, %ch,+i(S) is identified with the Chern-Simons form Qij+AA, S). We 
find that 


*oich 7 - + i(S f ) 


1 / iS \ J+] 

kf)i tr — 

O' + D! V2tt ) 

7 + 1 /*! 


1 / i V +A 

= 7—^ hr / 8tl ' tr( + + ) 

(j + 1)! \27r / Jo 

1 / i V +1 f 1 

-I / St str(A, Sf) 

Jo 


J! V 2 ^7 


(11.114) 


in agreement with (11.105). 


Although a characteristic class is gauge invariant, the Chern-Simons form 
need not be so. As an application of Cartan’s homotopy formula, we compute the 
change in Qij+\ (A. S) under A -> A g = g _1 (A + d)g, S' — > S g — g _1 Sg. 
Consider the interpolating families Af and Sf defined by 

Af = tg~ l Ag + g~ l dg (11.115a) 

= dA* + (Af) 2 = g~ l ? t g (11.115b) 

where S r = rS + (r 2 — f)A 2 . Note that Af = g _l dg, Af = A ? , Sf = 0 and 
Sf = S g . Equation (11.112) yields 

Q 2 j+i(A g ,V g )- Q2j+i(g~ l dg,0) = (dkoi + koid)Q 2 j+i(Af,Vf). (11.116) 

For example, let Q 2 j+i be the Chern-Simons form of the Chern character 
ch /+ i(S). Since dQ 2 j+i(Af , Sf) = ch/+i(Sf) = ch,+i(S r ), we have 

*0t dQ 2j+ i(.A*, Sf) = A-oich /+ i (Sf) 

= %chj +1 (S r ) = G 2 j+i(-A. ?) (11.117) 

where the result of example 11.9 has been used to obtain the final equality. 
Collecting these results, we write (1 1.1 16) as 

G 2 ; +1 (A*,S*)- G 2 j+i(A,S)= e 2 ;+i(g“ 1 dg,0) + d« 2; - (11.118) 

where a 2 j is a 2 J-form defined by 

oi 2 j(A, S, v) = koi Q 2 j+i(Af, Sf) 

= k 0l Q 2 j+i(At + v.9 r ) (11.119) 



where v = dg • g K [Note that Q 2 j+i(A, 30 = Qij+iigAg l ,g$g [ ).] The 
first term on the RHS of ( 1 1 . 1 1 8) is 

Qij+i(g~ l dg. 0) = St tr[g _1 dg{(r - r)(g _1 dg) 2 }- / ] 

-I(iy tr [(g-'dg) 2 * 1 ] J St(t 2 -t)j 

where we have noted that T, = ( t 2 — t)(g~ 1 dg) 2 and 

r 1 / ; i -i 2 

/ st(t 2 - t y = (-i yga + i,7 + i) = c-iy .. r~ 

Jo (2j + 1)! 

Z? being the beta function. The 2/ + I form Q 2 j+i(gdg, 0) is closed and, hence, 
locally exact: dQ 2 j+\(g~ l dg, 0) = ch /+ i(0) = 0. 

As for « 2 / we have, for example, 

«2 = \ J It tr[(A, + v)% -\{A t + u) 3 ] 

= K^r) L -v*! 

= - H^) tr(WlA) (11.121) 

where we have noted that 

tr A 2 = dx^ A d.r 1 ’ trC/l^/l,,) = — dx v A dx 11 tr (AyA^) — 0. 

Example 1 1.10. In three-dimensional spacetime, a gauge theory may have a 
gauge-invariant mass term given by the Chern-Simons three-form (Jackiw and 
Templeton 1981, Deser et a! 1982a, b). Since the Chern-Simons form changes 
by a locally exact form under a gauge transformation, the action remains invariant. 
We restrict ourselves to the U(l) gauge theory for simplicity. Consider the 
Lagrangian (we put A — iA, T = if) 

C = + \me^ v F Xll A v (11.122) 

where F^y — d^A v — 3 yA^. Note that the second term is the Chern-Simons form 
of the second Chern character F 2 (modulo a constant factor) of the U(1 ) bundle. 
The field equation is 


d^F^ + m * F" = 0 


(11.123) 



where 


*F li = \ ^ kX F kX F ^ = e ^ * F x . 

The Bianchi identity 

3 /x * F 11 = 0 (11.124) 

follows from ( 1 1.123) as a consequence of the skew symmetry of F 1 ' 1 ’ . It is easy 
to verify that the field equation is invariant under a gauge transformation, 

Ap -> A M + 3 /t (9 (11.125) 

while the Lagrangian changes by a total derivative, 

£ -> - iF^F^ + \me^ v F x ^A v + d v 6) = £ + ±m3 v (*F v 0). (11.126) 

Equation (11.106b) shows that the last term on the RHS is identified with 

Q 3 (A\ F e ) - Q 3 {A , F) ~ (A + d<9) dA - A dA ~ d(0dA). 

If we assume that F falls off at large spacetime distances, this term does not 
contribute to the action: 

J d 3 x£— > J d 3 xL+ 1 ^ J d 3 xd v (*F v 0) = J d 3 x£. (11.127) 

Let us show that (11.122) describes a massive field. We first write (11.123) 
as 

6^% * Fa = -m * F v . 

Multiplying s kXv on both sides, we have 

d X * F k —3 K * F x = —m F kX . 

Taking the (('' -derivative and using (1 1.124), we find that 

(d x d x + m 2 )* F k = 0 (11.128) 

which shows that *F K is a massive vector field of mass in. 


11.6 Stiefel-Whitney classes 

The last example of the characteristic classes is the Stiefel-Whitney class. In 
contrast to the rest of the characteristic classes, the Stiefel-Whitney class cannot 
be expressed in terms of the curvature of the bundle. The Stiefel-Whitney class 
is important in physics since it tells us whether a manifold admits a spin or not. 
Let us start with a brief review of a spin bundle. 



11.6.1 Spin bundles 

Let T M — M be a tangent bundle with dim M — m. The bundle T M is 
assumed to have a fibre metric and the structure group G is taken to be O (m). If, 
furthermore, M is orientable, G can be reduced down to SO (m). Let LM be the 
frame bundle associated with T M. Let be the transition function of LM which 
satisfies the consistency condition (9.6) 

Ujtjktki — I Li — /• 

A spin structure on M is defined by the transition function tij e SPIN(w) such 
that 

<p(Lj ) = Lj Lj t jkLki = I Li — I (11.129) 

where (p is the double covering SPIN(;«) — > SO (in). The set of tij defines a spin 
bundle PS(M) over M and M is said to admit a spin structure (of course, M 
may admit many spin structures depending on the choice of tij). 

It is interesting to note that not all manifolds admit spin structures. Non- 
admittance of spin structures is measured by the second Stiefel-Whitney class 
which takes values in the Cech cohomology group / / 2 ( M : TLi). 

11.6.2 Cech cohomology groups 

Let Z 2 be the multiplicative group {—1, +1}. A Cech r-cochain is a function 
/O'o, / 1 , . . . , i r ) € Z 2 , defined on £/,•„ P\ Uj l fl . . . fl Up ^ 0, which is totally 
symmetric under an arbitrary permutation P , 

/ 0>( 0) ip(r)) — f (*0, • • • > ir)- 

Let C' (M, Z 2 ) be the multiplicative group of Cech r-cochains. We define the 
coboundary operator 8 : C r (M; Z 2 ) — >■ C r+l {M\ Z 2 ) by 

r+l 

(V)0'o, . . ir+i) = n • ■ -,ij, ■ ■ ■ . ir+l) (11.130) 

7=0 

where the variable below the ' is omitted. For example, 

(SfoKh, h) = Mh)fodo) fo e C°(M; Z 2 ) 

(Sfi)(io,ii,h) — fi(ii,i2)fi(io,h)fi(io,ii) fl e C*(M; Z 2 ). 

Since we employ the multiplicative notation, the unit element of C r (M; Z 2 ) is 
denoted by 1. We verify that S is nilpotent: 

r+l 

(S 2 f)do, ■ • - , i r +i) = Y[ d, ir+l) = 1 

j,k= 1 



since —1 always appears an even number of times in the middle 
expression (for example if f(io, i k , . . . , + 2 ) = — 1, we have 

f(io, i r +2 ) — — 1 from the symmetry of /). Thus, we have 

proved, for any Cech /■ -cochain /, that 

8 2 f= 1. (11.131) 

The cocycle group Z' (M; Z 2 ) and the coboundary group B r (M: Z 2 ) are 
defined by 

Z' (M\ Z 2 ) = {/ e C' (M\ Z 2 )| 5/ = 1} (11.132) 

B r (M; Z 2 ) = {/ e C r (M; Z 2 )|/ = 5/', /' e C' _1 (M; Z 2 ). (11.133) 

Now the rth Cech cohomology group // ' ( A7 ; Z 2 ) is defined by 

H r (M; Z 2 ) - kerS,./im5,._i = Z' (M; Z 2 )/B' (M; Z 2 ). (11.134) 

11.6.3 Stiefel- Whitney classes 

The Stiefel- Whitney class w r is a characteristic class which takes its values in 
H r (M; Z 2 ). Let T M M be a tangent bundle with a Riemannian metric. The 

structure group is ()(/«), m = dim M. We assume { (7, } is a simple open covering 
of M, which means that the intersection of any number of charts is either empty 
or contractible. Let {e/c, }d < a < m) be a local orthonormal frame ofTM over 
Uj. We have = Bj^ja where f,y : £/,• fl U j — > O(m) is the transition function. 
Define the Cech 1 -cochain f(i, j ) by 


/(*. j) = det(fy) = ±1. (11.135) 

This is, indeed, an element of C l (M ; Z 2 ) since /(i, /) = f(j, i). From the 
cocycle condition tjjtjktki = /, we verify that 

8f(i, j, k) = deUtjj ) det(tjk) det (t ki ) 

— det (tjjtj k tki) — 1. (11.136) 

Hence, / e Z 1 (M, Z2) and it defines an element [/] of H l (M\ Z2). Now we 
show that this element is independent of the local frame chosen. Let {ej a } be 
another frame over Uj such that = hjej a , hi € O (m). From e 1Q . = Bjejct, we 
find t,j = hjtijhj 1 . If we define the 0-cochain /o by fo(i) = det/i/, we find that 

f(i, j ) = det (hitijhJ 1 ) — det (hi) det (h j) det (f,y ) 

= Vo(i, ./>./ •'. j) 

where use has been made of the identity det/?” 1 = det hj for h j e O(m). Thus, 
/ changes by an exact amount and still defines the same cohomology class f/]. 4 


4 


Note that the multiplicative notation is being used. 



This special element w\{M) = [/] e // 1 (M: Zf) is called the first Stiefel- 
Whitney class. 

Theorem 1 1.6. Let TM M be a tangent bundle with fibre metric. M is 
orientable if and only if w\ (M) is trivial. 

Proof. If M is orientable, the structure group may be reduced to SO (m) and 
f{i,j) — det (tij) — 1, and hence w\(M) = 1, the unit element of Z 2 . 
Conversely, if w\(M) is trivial, / is a coboundary; / = Sfy. Since fo(i) = ±1, 
we can always choose hi e O (m) such that det(/r,) = /o(0 for each i. If 
we define the new frame e 1Q . = we have transition functions tjj such 

that det (tjj) = 1 for any overlapping pair (i, j ) and M is orientable, [Suppose 
f(i, j) = det tjj — — 1 for some pair (i, j). Then we may take fo(i) — —1 and 
fo(j ) = +1, hence det by = — det t,y = +1.] □ 

Theorem 1 1 .6 shows that the first Stiefel- Whitney class is an obstruction to 
the orientability. Next we define the second Stiefel- Whitney class. Suppose M 
is an m-dimensional orientable manifold and TM is its tangent bundle. For the 
transition function tij e SO (m), we consider a ‘lifting’ tij e SPIN(m) such that 

<P(tij) = tij tji = tT l (11.137) 

where (p : SPIN(/w) — > SO (///) is the 2 ; 1 homomorphism (note that we have an 
option tij ** tjj or —tij). This lifting always exists locally. Since 

Vihjtjktld) — 6/ tjkjki = 1 

we have tijtjktki € ker (p = {±/}. For tjj to define a spin bundle over M, they 
must satisfy the cocycle condition, 

tijtjktki — 1- (11.138) 

Define the Cech 2-cochain /;{/,• f 1 U j fl Uk — »• Z 2 by 

tijtjktki = /O', j, k)I. (11.139) 

It is easy to see that / is symmetric and closed. Thus, / defines an element 
e 1 1 1 ( M . Z 2 ) called the second Stiefel-Whitney class. It can be shown 
that u> 2 (M) is independent of the local frame chosen. 

Exercise 11.5. Suppose we take another lift — tij of tjj. Show that / changes by 
an exact amount under this change. Accordingly, [/] is independent of the lift. 
[Hint: Show that f(i, j, k ) -> f(i, j.k)8f\(i, j , k) where f\ (/, j) denotes the 
sign of ±ijj .] 

Theorem 1 1.7. Let TM be the tangent bundle over an orientable manifold M. 
There exists a spin bundle over M if and only if i />2 ( M ) is trivial. 



Proof. Suppose there exists a spin bundle over M. Then we define a set of 
transition functions tjj such that tijtjktki — I over any overlapping charts Ui, U j 
and Uk, hence u> 2 (M) is trivial. Conversely, suppose w 2 (M) is trivial, namely 

fd, j ■ k) - SMi, j, k) = /i0\ k)f\ (i, k)f x (k , i) 

f\ being a 1 -cochain. We consider the 1 -cochain f\ (i, j ) defined in exercise 1 1.5. 
If we choose new transition functions P. = tij f\ (i, j), we have 

= [Sf] (i, j, k)] 2 = I 

and, hence, {?'. } defines a spin bundle over M. □ 

V 

We outline some useful results: 


(a) 

w l (CP m )= 1 u»(CP m )= 1 ' HOdd (11.140) 

I x m even 

x being the generator of // 2 (CP'" ; TLf). 

(b) 

wi(S m ) = w 2 (S m ) = 1 (11.141) 

(c) 

tui(E^) = w 2 (T, g ) = 1 (11.142) 

Eg being the Riemann surface of genus g. 
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INDEX THEOREMS 


In physics, we often consider a differential operator defined on a manifold M. 
Typical examples will be the Laplacian, the d’ Alembertian and the Dirac operator. 
From the mathematical point of view, these operators are regarded as maps of 
sections 

D : T(M, E) -* T(M, F) 

where E and F are vector bundles over M. For example, the Dirac operator is a 
map F ( M . E) —> F(M 1 E), E being a spin bundle over M. If inner products are 
defined on E and F, it is possible to define the adjoint of D, 

D r : F(M, F) -> r (M, E). 

Since it is a differential operator, D carries analytic information on the 
spectrum and its degeneracy. In what follows, we are interested in the zero 
eigenvectors of D and D * , 

kerD e{s£ T(M, E)\Ds = 0} 
kerZT E{se T(M, F)|D t s = 0}. 

The analytical index is defined by 

ind D — dim ker D — dim ker D 1 . 

Surprisingly, this analytic quantity is a topological invariant expressed in terms of 
an integral of an appropriate characteristic class over M, which provides purely 
topological information on M. This interplay between analysis and topology is 
the main ingredient of the index theorem. 

Our exposition follows Eguchi etal (1980), Gilkey (1984), Shanahan (1978), 
Kulkarni (1975) and Booss and Bleecker (1985). The reader should consult these 
references for details. Alvarez (1985) contains a brief summary of this subject 
along with applications to anomalies and strings. 

12.1 Elliptic operators and Fredholm operators 

In the following, we will be concerned with differential operators defined on 
vector bundles over a compact manifold M without a boundary. We exclusively 
deal with a nice class of differential operators called the Fredholm operators. 



12.1.1 Elliptic operators 


Let E and F be complex vector bundles over a manifold M. A differential 
operator D is a linear map 


D : T(M, E) — ► T(M, F). (12.1) 

Take a chart U of M over which E and F are trivial. We denote the local 
coordinates of U as x ,J . We introduce the following multi-index notation, 

M = (fii, fi 2 , . . ■ , fi,„) Hj e Z, ix j > 0 
\M\ = Hi + Fl + ' ' ' + Fm 

M dx M 3(x') w . . . 3(x m )t tm 
If dim E — k and dim F — k', the most general form of D is 

[£>s(jc)]“ = J2 A Ma a (x)D M s a (x ) 1 <a<k’ (12.2) 

\M\<N 
1 <a<k 

where s(x) is a section of E. Note that x denotes a point whose coordinates are 
x^. This slight abuse simplifies the notation. A M = ( A M ) a a is a lx k’ matrix 
which may depend on the position x. The positive integer N in (12.2) is called 
the order of D. We are interested in the case in which /V = I (the Dirac operator) 
and N — 2 (the Laplacian). For example, if F is a spin bundle over M, the Dirac 
operator D = i y^d/j + m : T(M, E ) -> T(M, E) acts on a section t//(x) of E as 

[D^m a = Hy 1 ") 01 (x) + mf a (x). 

The symbol of D is a k x k' matrix 

<r(D,§)= AMa A(x)^M (12.3) 

\M\=N 


where | is a real ///-tuple | = (fi,...,f m ). The symbol is also defined 
independently of the coordinates as follows. Let E — M be a vector bundle 
and let p e M, § e T*M and 5 e 1 Take a section s e T(M, E) such that 
s{p) = s and a function / e T(M) such that f(p) = 0 and d f(p) — $ e T*M. 
Then the symbol may be defined by 

a( D ^)s = ^E>(f N ~s)\ p . (12.4) 

The factor f N automatically picks up the /Vth-order term due to the condition 
f(p) = 0. Equation (12.4) yields the same symbol as (12.3). 



If the matrix a(D. f ) is invertible for each x e M and each ? e K'” — {0}, 
the operator D is said to be elliptic. Clearly this definition makes sense only when 
k — k' . It should be noted that the symbol for a composite operator D — D\ o £>2 
is a composite of the symbols, namely a(D, ?) = o(D\, ?)o(D 2 , ?)■ This shows 
that composites of elliptic operators are also elliptic. In general, powers and roots 
of elliptic operators are elliptic. 


Example 12.1. Let x 1 ' be the natural coordinates in W" . If E and F are real line 
bundles over M'” , the Laplacian A : T (TR m ,£■)—> r(K"' , F ) is defined by 


A = 


d(x m ) 


,2 • 


dix 1 ) 2 

According to (12.3), the symbol is 

a(A,§) = £(^) 2 . 

This is in agreement with the result obtained from (12.4), 
a(A. { ,s = 1a ( /^)|, = 1 £ lr 


(12.5) 


3/ 35 


= '-(fA i + 2fAr s + 2fZ^ ax , 
= £(^) 2 *- 


.v- 9 / 3 / 

-2J2——S 


dx^ dx ^ 


This symbol is clearly invertible for £ ^ 0, and hence A is elliptic. 
However, the d’Alembertian 


□ = 


dix 1 ) 2 

is not elliptic since the symbol 


+ ••• + 


dix™- 1 ) 2 d(x m ) 2 


(12.6) 


( n ,$) = ($ 1 ) 2 + --- + ($ m ~ 1 ) 2 -($ m ) 


i-m\ 2 


vanishes everywhere on the light cone, 

(? m ) 2 = (f x ) 2 + b (| m_1 ) 2 . 

Exercise 12.1. Let M — K 2 and consider a differential operator D of order two. 
The symbol of D is of the form 

cr(D,?) = An? 1 ? 1 +2A 12 ? 1 ? 2 + A 22 ? 2 ? 2 . 


Show that D is elliptic if and only if a(D. ?) — 1 is an ellipse in £-space. 



12.1.2 Fredholm operators 


Let D : F (M, E) -> V (M . F ) be an elliptic operator. The kernel of D is the set 
of null eigenvectors 

kerD = {s e T{M, E)\Ds = 0}. (12.7) 

Suppose E and F are endowed with fibre metrics, which will be denoted ( , )e 
and ( , ) f, respectively. The adjoint D ' : T(M, F) — >■ T(M, £) of D is defined 
by 

(/,Os)f = (dV,j) e ( 12 . 8 ) 

where .v e T(M, E) and ,y' e T(M, F). We define the cokernel of D by 

coker Z) = F(M, F)/imD. (12.9) 

Among elliptic operators we are interested in a class of operators whose 
kernels and cokernels are finite dimensional. An elliptic operator D which 
satisfies this condition is called a Fredholm operator. The analytical index 

ind D = dim ker D — dim coker D (12.10) 

is well defined for a Fredholm operator. Henceforth, we will be concerned only 
with Fredholm operators. It is known from the general theory of operators that 
elliptic operators on a compact manifold are Fredholm operators. Theorem 12.1 
shows that ind D is also expressed as 

ind D = dim ker D — dim ker D 1 . (12.11) 

Theorem 12.1. Let D : T(M, E) -> F (M, F) be a Fredholm operator. Then 

coker D = kerZ^ e{j£ T(M, F)|D t .y = 0}. (12.12) 


Proof. Let [.v] e coker D be given by 

[s] = {s' e r (Af, F)|s' = s + Du, u e T(M, E)}. 

We show that there is a surjection ker D' — > coker D, namely any [.v] e coker I) 
has a representative .vo e ker D ' . Define so by 

s 0 = s-D—^- D f s. (12.13) 

D f D 

We find so e ker D' since D^sq — D^s — D* D(D* D)~ l D 1 s = D' s — D' s — 
0. Next, let so,s' 0 e kerD’ and so ^ ■Sq' s h° w that [^o] 7 ^ iTo] ' n 
T (M, F)/imD. If [so] = [ sq ], there is an element u e T(M, E) such that 
so — Sq = Du. Then 0 = (n, D' (so — s' q ))e — (n, D' Du)e = {Du, Du)p > 0, 
hence Du = 0, which contradicts our assumption so ^ sk . Thus, the map 
so 1 -^ [s] is a bisection and we have established that coker D = ker D * . □ 



12.1.3 Elliptic complexes 

Consider a sequence of Fredholm operators, 

► r (M, r) r(M, £,■) A r(M, E i+ 1) ^ (12.14) 

where {£, } is a sequence of vector bundles over a compact manifold M. The 
sequence (£)•, D ( ) is called an elliptic complex if /), is nilpotent (that is 
DjoDj-i — 0) for any i. The reader may refer to T(M, £)■) = S2,-(Af) and D, = d 
(exterior derivative) for example. The adjoint of D, : T(M, £, ) — > T(M, £)+i) 
is denoted by 

£>• : T (M, E;+i) -> F(M, £,-)• 

The Laplacian A, : T(M, /;, ) —> F(M, £, ) is 

A, = A-iDVi +£>JA. (12.15) 

The Hodge decomposition also applies to the present case, 

si = Di-ist-i + D' is i+ 1 + /t, (12.16) 

where .v,±i e F(M, £)±i) and hi is in the kernel of A/, A,7z; = 0. 

Analogously to the de Rham cohomology groups, we define 

H‘(E, D) = ker A/imA-t- (12.17) 

As in the case of the de Rham theory, it can be shown that H' (E, D) is isomorphic 
to the kernel of A/. Accordingly, we have 

dim H l {E, D) — dim Harm' (£\ D) (12.18) 

where Harm' ( E . D) is a vector space spanned by {/z, }. The index of this elliptic 
complex is defined by 

m m 

ind D = £](—iy dim //'(£, D) = ^(-1)' dimker A,. (12.19) 

i=0 i=0 

The index thus defined generalizes the Euler characteristic, see example 12.2. 

How is this related to (12.10)? Consider the complex F(M, E) Y (M . F). 
We may formally add zero on both sides, 

o4r(M,£)4r(M,f)^o ( 12 . 20 ) 

where i is the inclusion. The index according to (12.19) is 


dim ker I) — {dimr(M, F) — dimimZ)} = dim ker I) — dim coker/1 



where we have noted that dimim; = 0, kenp = T(M, F) and coker/) = 
kerip/ im D. Thus, (12.19) yields the same index as (12.10). 

It is often convenient to work with a two-term elliptic complex which has the 
same index as the original elliptic complex ( E , D). This rolling up is carried out 
by defining 

E+ = @E2r, £- = © E 2r +\ (12.21) 

r r 

which are called the even bundle and the odd bundle, respectively. 
Correspondingly we consider the operators 

A = @{D 2r + £» t 2r-l), = ®(D 2r+ i + D' lr ). (12.22) 

r r 

We readily verify that A : T(M, E + ) — >■ T(M, £_) and : T(M, £_) — >■ 
T (M, E+). From A and ,4 f , we construct the two Laplacians 

A + = A 1 A = ®{D 2r +i + D' 2r )(D 2s + D^ 2 s -i) 

r,s 

— ®(D 2r -\ IF 2r— I + D i 2rD 2r ) = © A 2r 

r r 

A_ = AA f = 0 A 2 , + i. 

r 

Then we have 

ind(£±, A) = dimker A + — dimker A_ 

= ^(-l) r dimker A, = ind(£, D ). (12.24) 

Example 12.2. Let us consider the de Rham complex Q(M) over a compact 
manifold M without a boundary, 

0 4 Q°(M) 4 Q 1 (M) 4 i £2 m (M) 4 0 (12.25) 

where m = dimM and d stands ford,. : Q. r (M) —> ^ r+1 (M). H r (E, D) defined 
by (12.25) agrees with the de Rham cohomology group H r (M, Iff). The index is 
identified with the Euler characteristic, 

m 

ind(fi*(M),d) = ^(-l) r dim H r (M; Iff) = /(M). (12.26) 

r = 0 


(12.23a) 

(12.23b) 


We found in chapter 7 that b r = dim H r (M.W) agrees with the number of linearly 
independent harmonic r -forms: dim H r (M, Iff) = dim Harm'' (M) = dimker A,, 
where A,- is the Laplacian 


A,. = (d+ d f ) 2 = d,-id T ,._ i + d f r d r 


(12.27) 


dj : £2' +1 (M) — > Q, 1 ' (M) being the adjoint of d,-. Now we find that 


;(M) = ^(-l)'dimker A,. 


(12.28) 


This relation is very interesting since the LHS is a purely topological quantity 
which can be computed by triangulating M, for example, while the RHS is given 
by the solution of an analytic equation A, n = 0. We noted in example 11.6 that 
/(M) is given by integrating the Euler class over M: /(M) = f M e(T M). Now 
(12.28) reads 


rn „ 

T(-iy dimker A,- = / 

Jm 


e(T M). 


(12.29) 


This is a typical form of the index theorem. The RHS is an analytic index while 
the LHS is a topological index given by the integral of certain characteristic 
classes. In section 12.3, we derive (12.29) from the Atiyah-Singer index theorem. 
The two-term complex is given by 


G + (M) = © Qr r (M) Q~(M) = ®Q, 2r+l (M). 


(12.30) 


The corresponding operators are 


A = ®(d2,- + d’ 2 ,- 1 ) A* = ®(d2r-l + d' 2 , 0 . 


(12.31) 


It is left as an exercise to the reader to show that 


indt^lM), A) = dimker A + — dimker A_ = /(M). 


(12.32) 


12.2 The Atiyah-Singer index theorem 
12.2.1 Statement of the theorem 

Theorem 12.2. (Atiyah-Singer index theorem) Let ( E,D ) be an elliptic 
complex over an m -dimensional compact manifold M without a boundary. The 
index of this complex is given by 

ind(E, D) = (-l)"(”+D/2 [ C h( ®(-D r £ r ) ' (12J3) 

Jm v '■ > e(TM) vol 

In the integrand of the RHS, only m-forms are picked up, so that the integration 
makes sense. [ Remarks : The division by e(T M) can really be carried out at the 
formal level. If m is an odd integer, the index vanishes identically, see below. 
Original references are Atiyah and Singer (1968a, b), Atiyah and Segal (1968).] 

The proof of theorem 12.2 is found in Shanahan (1978), Palais (1965) and 
Gilkey (1984). The proof found there is based on either K -theory or the heat 



kernel formalism. In section 13.2, we give a proof of the simplest version of the 
Atiyah-Singer (AS) index theorem for a spin complex. Recently physicists have 
found another proof of the theorem making use of supersymmetry. This proof is 
outlined in sections 12.9 and 12.10. Interested readers should consult Alvarez- 
Gaume (1983) and Friedan and Windey (1984, 1985) for further details. 

The following corollary is a direct consequence of theorem 12.2. 

Corollary 12.1. Let T(M, E) 4 T(M, F) be a two-term elliptic complex. The 
index of D is given by 

ind D — dim ker D — dim ker D ' 

= (_1 yn(m+V)l2 [ ( c hff-chF) Td(rM } . (12.34) 

Jm e(T M) vol 

12.3 The de Rham complex 

Let M be an m -dimensional compact orientable manifold with no boundary. By 
now we are familiar with the de Rham complex, 

i f2' -1 (M) C 4 £/(M) C 4 Q r+l (M) C 4 ••• (12.35) 

where £2' (M) c = T(M, A r T*M c ). We complexified the forms so that we may 
apply the AS index theorem. The exterior derivative satisfies d 2 = 0. To show 
that (12.35) is an elliptic complex, we have to show that d is elliptic. To find the 
symbol for d, we note that 

o'(d,?)oj = d(fs)\ p = d/As + fds\ p = § A to 

where p e M, a> e £2' p (M) c , f(p) — 0, d f{p) = ?,s e £2 r (M) c and 
s{p) = u>\ see (12.4). We find 

cr(d,§) = §A. (12.36) 

This defines a map £2 r (M) c -> £2' + 1 (M) c and is non-singular if § / 0. 
Thus, we have proved that d : £2 r (M) c — »■ £2' +1 (M) c is elliptic and, hence, 
(12.35) is an elliptic complex. Note, however, that the operator d : Q k (M ) -> 
£2 k+1 (M) is not Fredholm since kerd is infinite dimensional. To apply the index 
theorem to this complex, we have to consider the de Rham cohomology group 
H 1 (M) instead. The operator d is certainly Fredholm on this space. 

Let us find the index theorem for this complex. We note that 
dime H r (M\ C) = dinip : H r (M; IK). Hence, the analytical index is 

m 

indd = ^(— l) r dim c //' (M; C) 

r = 0 

= l) r dim W) = x(M) 


(12.37) 



where y (M) is the Euler characteristic of M. Suppose M is even dimensional, 
m — 21. The RHS of (12.33) gives the topological index 


(_ 1 )*( 2 /+ 1 ) 



© (-l) r A r T*M C 

r = 0 


Td (TM C ) 
e(T M) 


vol 


The splitting principle yields 


(12.38) 


chf © (-iy A r T*M C ) 

V= 0 / 

= 1 - ch (T*M C ) + ch(A 2 T*M c ) + • • • + (-l)'"ch(A m T*M C ) 

m 

= 1 - e_x ‘ (TM C ) + e_x ‘ e ~ Xj ( TM C ) + ■■■ 

i=l i<j 

+ (—l) m e~ Xl e ~ X2 . .. e _, "(rM C ) 

m 

= ]~ [(1 - e~ Xi )(TM C ) 

i=i 


where we have noted that Xj(T*M c ) — —Xj(TM c ). [Let L be a complex line 
bundle and L* be its dual bundle. L <S> L* is a bundle whose section is a map 
C -* C at each fibre of L. L <g> L* has a global section which vanishes nowhere 
(the identity map, for example) from which we can show L®L* is a trivial bundle. 
Wehaveci(L®L*) = c\{L) + c\(L*) — 0, hence x(L*) — —x{L). The splitting 
principle yields Xj{T*M c ) = —Xj(T M c ).] We also have 

m 

Td (TM C ) = Ff — — (TM C ) 

1 I 1 _ Q~Xi 

i = 1 
l 

e(T M) = Y\x>(TM c ). 

i = 1 


Substituting these in (12.38), we have 

indd = J (-l) l{V+l) (-l) l (j~[xi(TM C )j = J e(T M). 

If m is odd, it can be shown that (Shanahan (1978), p22) 


ind d = 0 


(12.39) 


(12.40) 


which is in harmony with the fact that e{T M) — 0 if dim M is odd. In any case, 
the index theorem for the de Rham complex is 

X (M)= [ e{TM). 

Jm 


(12.41) 



Example 12.3. Let M be a two-dimensional orientable manifold without 
boundary. Equation (12.41) reads 


X(M) = -L f e «Pn ap = ^~ [ IZu (12.42a) 

47T in Jm 

which is the celebrated Gauss-Bonnet theorem. For dim M = 4, it reads as 

xm = ^2 f A -R rS . (12.42b) 

12.4 The Dolbeault complex 

We recall some elementary facts about complex manifolds (see chapter 8 for 
details). Let M be a compact complex manifold of complex dimension m without 
a boundary. Let z 1 ' = x 1 ' + i>' ,/ be the local coordinates and z. p = x p — \y p 
their complex conjugates. T M + denotes the tangent bundle spanned by { d/dz p } 
and T M~ = T M + the complex conjugate bundle spanned by {d/dzl 1 }. The 
dual of T M + is denoted by T*M + and spanned by {dz M } while that of T M~ is 
T*M~ = T*M + spanned by { dz /; } - The space ' of complexified /--forms 

is decomposed as 

£2 r (M) c = © £2 p - q (M) 

p+q=r 

where Q pq (M) is the space of the ( p. //(-forms, which is spanned by a basis of 
the form 

d A ... A d z Pp A dz 1 ’ 1 A ... A d z Vq . 

The exterior derivative is decomposed as d = 3 + 3 where 


(12.43a) 
(12.43b) 

(12.44) 

To show that (12.44) is an elliptic complex, we compute the symbol for 3. 
Let § = § 0-1 + § 1,0 be a real one-form at p e M, where £ 0,1 e and 

|i.o = |oT e £2p°(M). 


3 = dz p A 3/3 z 11 3 = dz. p A 3/3 z p . 

They satisfy 33 + 33 = 3 2 = 3 2 = 0. We have the sequences 

i Q, p,q (M) 4 £2 p ’ q+l (M) 4 ••• 

i Q, p,q {M) 4 £2 p+hq (M) 4 ■ ■ • . 

We are interested in the first sequence with p = 0, 

i n°’ q (M) 4 n°- q+l (M) 4 • ■ ■ . 

This sequence is called the Dolbeault complex. 


Take an anti-holomorphic / -form co e Q <]J (M ). We find 

a(3, $)co = d (fs) = 3/ A S + /3s | p = I 0 ' 1 A ® 
where f(p) = 0, 3 f(p) = £ 0,1 , s e Q°’ r (M) and s(p) = co. We have 

er(3, £) = | 0,1 A . (12.45) 


From a similar argument to that given in the previous section, it follows that the 
symbol (12.45) is elliptic. Thus, the Dolbeault complex (12.44) is an elliptic 
complex. 

The AS index theorem takes the form 


ind 3 = 



(-1/ A r T*M~ 


Td(TM c ) 
e(T M) 


vol 


(12.46) 


The LHS is computed as follows. We first note that 
ker 3,/im3,_i = H°' r (M ) 

where H 0 r (M ) is the 3-cohomology group. Then the LHS is 


ind 3 = J](-l ) r b°’ r 

r = 0 


(12.47) 


where b t)j = dini j II t< r (M ) is the Hodge number. This index is called the 

arithmetic genus of M. 

Simplification of the topological index can be carried out as in the case of 
the de Rham complex. We refer the reader to Shanahan (1978) for the technical 
details. We have 


r= 1 



Td(TM+) 


(12.48) 


where Td(T M + ) is the Todd class of T M + . 


12.4.1 The twisted Dolbeault complex and the Hirzebruch-Riemann-Roch 
theorem 

In the Dolbeault complex, we may replace Q (l ' r (M) by the tensor product bundles 
£2°’ r (M) <g> V , where V is a holomorphic vector bundle over M, 

■■■ ® V X Q.°' r (M) ® V ^ ■ ■ ■ . (12.49) 

The AS index theorem of this complex reduces to the Hirzebruch-Riemann- 
Roch theorem, 

ind 3y = / Td(TM + )ch(F). 

Jm 


(12.50) 



For example, if m — dimr; M — 1 , we have 


ind 9y = ^ dim L f a(TM + )+ f ci(V ) 

Jm Jm 

r i S' 

= (2 - g) dim V + / — 

Jm 

since it can be shown that 

f c\(TM + ) — [ e(TM) = 2-g 
Jm Jm 


g being the genus of M. 


(12.51) 


12.5 The signature complex 
12.5.1 The Hirzebruch signature 

Let M be a compact orientable manifold of even dimension, m = 21. Let [of] and 
[rj] be the elements of the ‘middle’ cohomology group / / / ( M : M). We consider a 
bilinear form H\M\ M) x H'(M\ M) — >■ K defined by 

[ «A»7 (12.52) 

Jm 

cf example 11.8. This definition is independent of the representatives of [o>] and 
[??]. The form a is symmetric if l is even (m = 0 mod 4) and anti-symmetric if 
/ is odd (in = 2 mod 4). Poincare duality shows that the bilinear form cr has the 
maximal rank b 1 = dim H‘(M; TO) and is, hence, non-degenerate. If / = 2k is 
even, the symmetric form cr has real eigenvalues, b + of which are positive and b~ 
of which are negative (b + + b~ — b '). The Hirzebruch signature is defined by 

r(M) =b+ - b~. (12.53) 

If l is odd, r (M) is defined to vanish (an anti- symmetric form has pure imaginary 
eigenvalues). In the following, we set / = 2k. 

The Hodge * satisfies * 2 = 1 when acting on a 2/M'orm in a 4A'-dimensional 
manifold M and hence * has eigenvalues ±1. Let Harnr^LV/) be the set of 
harmonic 2A'-forms on M. We note that Harm 2k (M) = H 2k (M\ IR) and each 
element of H 2k (M\ IR) has a unique harmonic representative. Harm 2k (M) is 
separated into disjoint subspaces, 

Harm 2 k (M) = Harm^'(M) © Harmi A '(M) (12.54) 

according to the eigenvalue of *. This separation block diagonalizes the bilinear 
form cr . In fact, for co^ 1 e Harm^ (M), 

a(a> + ,a> + )= I co + A m + = I co + A *co + = (a> + , a> + ) > 0 
Jm Jm 



where (o> + , w + ) is the standard positive-definite inner product defined by (7.181). 
We also find 

a(co~ . a>~) = — I a>~ A *a>~ — —(ft) - , u>~) < 0 
Jm 

a(ft) + , ft; - ) — — I ft> + A *co~ = — I u>~ A *&) + = — a(ft) + , o )~ ) = 0 
Jm Jm 

where we have noted that a A */8 = A *a for any forms a and /l. Hence, 
a is block diagonal with respect to Harm^’(M) © Harnr^(M) and, moreover, 
b ± = dimjj Harm^ fM). Now r(M) is expressed as 

r(M) = dimHamrf (M) - dimHarnr A (M). (12.55) 

Exercise 12.2. Let dimM = 4k. Show that 

r (M) = x (M) mod 2. (12.56) 

[Hint: Use the Poincare duality to show that x(M) — b 2k mod 2.] 

12.5.2 The signature complex and the Hirzebruch signature theorem 

Let M be an m -dimensional compact Riemannian manifold without a boundary 
and let g be the given metric. Consider an operator 

2) = d + d f . (12.57) 

2) is a square root of the Laplacian: 2) 2 = dd : + d ; d = A. To show that 2) 
is elliptic, it suffices to verify that A is elliptic since the symbol of a product of 
operators is the product of symbols. Let us compute the symbol of A. As for d, 
we have er(d, §)&> = £ A w. As for d* , it can be shown that (Palais 1965, pp77-8) 

cr(d t ,f) = -!>. (12.58) 

Here i ^ : f2' p (M) — > £2 r ~ l (M) is an interior product defined by (cf. (5.79)) 

i|(dx M1 A ... A d.U'"') 

r 

= ^(-l)- /+ 1 g M -' ,t ^ dx ^ 1 A ... A dx'N A ... A dx Mr 
;=i 

where the one-form under ' is omitted and we put £ = | /x dx /x . Now the symbol 
of the Laplacian is obtained from (12.58) as 

ct(A, £)&> = a(dd' + d’d, f)&> = — [£ A i^(a>) + /'j(| A &>)] 

= - l|(£) Aft) = -||§|| 2 ft) 



where a> is an arbitrary /--form and the norm || || is taken with respect to the given 
Riemannian metric. Finally, we obtain 

a(A,f) = -||f|| 2 . (12.59) 

Thus, the Laplacian A is elliptic and so is D — d + d 1 . 

Since the Laplacian A = ID 2 is self-dual on the index of A vanishes 

trivially. It is also observed that D = ID' on Q*( M ) and, hence, ind D — 0. 
To construct a non-trivial index theorem, we have to find a complex on which 

Exercise 12.3. Consider the restriction D e of D to even forms, D e : Q L '(M ) -> 

f2°(M) c where £2 e (M) c = ©£2 2, '(M) C and Q°(M) C = ®Q 2i+l {M) c . The 
adjoint of D e is D° = D e ^ : f2°(M) c -> £2 e (M) c . Show that 


indD e = dimkerD 6 — dimkerD 0 = /(M). 


[Hint: Prove kerD e = ©Harm 2, (M) and kerlD 0 = ©Harm 2,+1 (M). This 
complex, although non-trivial, does not yield anything new.] 

If dimM = m = 21, we have * * ri — (— 1 )' rj for rj e f2'(M) c . We define 
an operator 7T : £2 r (M) c -» Q, m ~ r (M) c by 


it = i r( '' l),/ * . 


(12.60) 


Observe that tt is a 'square root’ of (— l) r * * = 1 . In fact, for to e Cl 1 (M) c , 

n 2 U = i r(r - l)+, Jl(*CO) = i r(r-l) + lM2l-r)i2l-r-l)+, * ^ 

= = ( — iy * = to (12.61) 

where we have noted that r = r 2 mod 2. We easily verify (exercise) that 


{7T, D} = 7tD + Dtt = 0. (12.62) 

Let it act on = ©£2 r (M) c . Since it 2 — 1, the eigenvalues of n are ±1. 

Then we have a decomposition of £2*(M) c into the ±1 eigenspaces of j r 

as 

= S2+(M) © (12.63) 

Since D anti-commutes with 7t, the restriction of D to Q + (M ) defines an elliptic 
complex called the signature complex, 

D+ : £2+(M) -> Q.~(M) (12.64) 


where D + = ID|£ 2 +(M)- The index of the signature complex is 


ind ID -|_ = dimkerlD-|_ — dimkerlD- 

= dimHarm(M) + — dimHarm(M) 


(12.65) 



where ID_ = : £2~(M) — >■ Q, + (M) andHarmlAf^ = {a> e £2 ± (M)|ID±&> — 

0}. On the RHS of (12.65), all the contributions except those from the harmonic 
/-forms cancel out. To see this, we separate ker D + and ker2)_ as 

ker2)± = Harm '(AT)* © ^ [Harm' (M)* © Harm'" - ' (M)^ 

0 <r<l 

where Harm' (M)^ = Harm(/V/ ) ± n Q r (M). If o> e Harm'(M), we have 
coJzkco e Harm' (M) 1 * 1 ©Harm'" - ' (M)*. Then a map co+nco — > co—ttco defines 
an isomorphism between Harm '(M) + © Harm'" - '(M) + and Harm r (M)~ © 
Harm'" - ' (M) - . Now the index simplifies as 

ind D + = dim Harm 2/ '(/W ) + — dimHarm 2k (M)~ (12.66) 

where we put I — 2k as before (the index vanishes if / is odd). It is important to 
note that Harm 2A (/V/) ± = Harm^'(M) since tt = * in Harm 2k (M), see (12.54). 
Now the index (12.66) reduces to the Hirzebruch signature, 

ind2)+ = r(M). (12.67) 


The derivation of the topological index is rather technical and we simply 
quote the result from Shanahan (1978). Let A 1 * 1 T*M c be the subspace of A 
such that — T(M, A ± 7’*A/ c ). Then we have 


Td( T M ' ) 

topological index = (-1)' / ch(A +T*M C - A ~T*M C ) — 

Jm e(TM ) 


= 2 l 


»'/ 

Jk 

f rr x i/ 2 

‘ M i =l 


vol 


tanh Xj / 2 


vol 


/„ri 

Jm . =1 


tanh xi 


vol 


where the last equality is true only for the 2/-forms in the expansion and X[ = 
Xi(T M c ). Now we have obtained the Hirzebruch signature theorem 

r (M)= [ L(TM) | voi (12.68) 

Jm 

where L is the Hirzebruch L-polynomial defined by (11.91). Since L is even in 
Xi, x (M) vanishes if m = 2 mod 4. For example, r (M) = 0 for m — 2. If m = 4, 
we have 

r(M) = I Pl (TM ) = -^5 J tr 1Z 2 . (12.69) 

As in the case of the Dolbeault complex, we may twist the signature complex, 
see Eguchi et al (1980), for example. 


12.6 Spin complexes 

The final example of classical complexes is the spin complex. This complex is 
very important in physics since it describes Dirac fields interacting with gauge 
fields and/or gravitational fields. 



12.6.1 Dirac operator 


Let us consider a spin bundle S(M) over an w-dimensional orientable manifold 
M. We shall denote the set of sections of this bundle by A (M) — V (M . S(M)). 
We assume that m — 21 is an even integer. The spin group SPIN(;«) is generated 
by m Dirac matrices {y 01 }, which satisfy 


= y a 


{y a ,y fi } = 28 afl . 


(12.70a) 

(12.70b) 


Throughout this chapter we assume that the metric has the Euclidean signature. 
The Clifford algebra is generated by 

1; y a ; y ai y a2 (at <o 2 );...; 

y ai ... y ak (on < ... < a k )\ . . . ; y 1 . . . y 21 . 

The last generator is of particular importance and we define 


y m+ 1 = i l y l 


■ r 


(12.71) 


Our convention is such that ( y m+l ) 2 = I and (y m + 1 )t = y m + ] . [t can be shown 
from the general theory of the Clifford algebra that the y x are represented by 
2 l x 2 l matrices with complex entries. It is convenient to take a representation of 
{y x } such that y m+l is diagonal. 


where 1 here is the 2 1 1 x 2' 1 unit matrix. 


y m+l = 


■>1-1 


1 0 

0 -1 


(12.72) 


Example 12.4. For m = 2, we take 
y° = er 2 / = 
o a being the Pauli matrices, 


o\ 


(Tl 


i) 1 
1 0 


Oi — 


For m = 4, we may take 
Y P = 


0 

- ia E 


ia h 


y 3 = iyV 


i 0 


03 — 


03 


a & — ( I 2 , —iff), a“ = (h. iff) 


y 5 = -y°y 1 y 2 y 3 = 


h 0 
0 — 1 2 



A Dirac spinor i Js e A (M) is an irreducible representation of the Clifford 
algebra but not that of SPIN(2/). Irreducible representations of SPIN(2/) are 
obtained by separating A (M) according to the eigenvalues of y 1,1+1 . Since 
(y m+ 1 ) 2 = /. the eigenvalues of y m+1 , called the chirality, must be ±1. Then 
A (M) is separated into two eigenspaces 

A(M) = A+(M) © A“(M) (12.73) 

where y m+l i/r ± — for i/ f± e A ± (M). The projection operators V ± onto 

A* are given by 

V+ = ] -(l + y' n+l ) = ( J o ) (12.74a) 

V~ ee 1(7 - y m+l ) = ([j J ) ■ (12.74b) 

Thus, we may write 1 

y + = ( X ) e A+(M)> = ( *- ) e A_(M) ' (12-75) 

The reader should verify that V + + V~ — 1, ("P 1 * 2 ) 2 — ’P ± ,V + V~ = 
0, V ± \lf ± = ijr* and V ± f T = 0. 

The Dirac operator in a curved space is given by (section 7.10) 

Wf = iy^d/dx^f = iy M (3/r + (12.76) 

where co ^ — jico^Hafi is the spin connection and = y 0l e a 11 - Let us prove 
that iy is elliptic. Let / be a function defined near p e M such that f(p) — 0 

and iy^d^fip) — i y^l-^ = i$. 2 Take a section i/f e A (M) such that ijr(p) — ir. 

From (12.4), we have 

CT (iy, t-w = xU(fh\ P = <&m\ P = 'm 

which shows that 

<r(iy,*) = 4f. (12.77) 

If we note that $$ = ^ a ^py a y^ = we find that (12.77) is invertible for 

i|f ^4 0, hence iy is an elliptic operator. 

It can be shown that {y“ \ is taken in the form 

7 ) (12 - 78) 

1 Note the minor abuse of the notation. 

2 For a vector A = A^e^, fl. denotes y^A^. 



see example 12.4 for m — 2 and 4. Then (12.76) becomes 


0 D* 
D 0 


where 


D = a^e^id^ + co^) D f = -a^e^ (9 M + co ^). 


(12.79) 


(12.80) 


Hence, D' is, indeed, the adjoint of D (note that 3 /t + &j /; is anti-Hermitian). For 


e A + (M) 


we have 


while for 


we have 


0 D f 
D 0 


e A ~(M) 


Hence, D = ifV + : A +(M) -> A“(M) and = ifV~ 
A + (M). Now we have a two-term complex 


A ~{M) 


A +(M) A~(M) 


(12.81) 


called the spin complex. The analytical index of this complex is 


ind D = dim ker D — dim ker D 1 


v+ — V- 


(12.82) 


where v+ (v_) is the number of zero-energy modes of chirality + (— ). 

Let us apply the AS index theorem to this case. Without getting into the 
details of the Clifford algebra and the spin complex, we simply write down the 
result. The AS index theorem for the spin complex (12.81) is 


v+-V-= / ch(A+(M) - A“(M)) 

J M 


Td(TM c ) 

e(TM) 


A(TM) |voi 


(12.83) 


where A is the Dirac genus defined by (1 1.94). Since A contains only 4 /-forms, 
v + — V- vanishes unless m — 0 mod 4. Of course, this does not necessarily imply 
v+ = V- = 0. The proof of (12.83) will be given later in sections 12.9 and 12.10. 



12.6.2 Twisted spin complexes 


In physics, a spinor field may belong to a representation of a group G. For 
example, the quark field in QCD belongs to the 3 of SU(3). A spinor which 
belongs to a representation of G is a section of the product bundle S(M) ® 
E, where E is an associated vector bundle of P(M, G) in an appropriate 
representation. The Dirac operator De : A + (M) <g> E — > A (M)~ <S) E in this 
case is 

D e = i y + (o^ + A, l )V + (12.84) 

where A is the gauge potential on E. The AS index theorem for this twisted spin 
complex is 

v+-v-= [ A(T M)ch(£)|voi- (12.85) 

Jm 

For dim M = 2, we have 

v+-v_=/ chi (E)=2-f trT (12.86) 

Jm 2tt Jm 

while for dim M = 4, 


v+-v_= [ [ch 2 (£) + Ai(T M)ch 0 (E)] 
Jm 

dim E 

tr 3 r ~ H ^ 

JM 192tt 2 


-If 2 dim £ f 
8^ Jm 1 " + 192tt 2 J m ' 


tr 7? 2 . 


Example 12.5. Let 


M = T 21 = S 1 x • • • x S 1 . 


Then we find 


2/ times 

21 


A(T M) = a( © TS 1 ) = ]“[ A(T S' 1 ) 

1 l 


= 1. 


(12.87) 


We also have 4(7’ ,S' 2 ) = 1. Accordingly, the index of these bundles is 


u + — V- 



ch(£)| vo i. 


(12.88) 


Example 12.6. Let us consider the monopole bundle P{S 2 , U(l)). If A is the 
local gauge potential, the field strength is 3" = d.4. The index theorem is 


t>+ — V- 


— f T = — [ F 

2: x J s 2 2: r J s i 


(12.89) 


where 3" = i F . As was shown in section 10.5, the RHS represents the winding 
number tti(U( 1)) = Z and analytical information (the LHS) is now expressed in 
a topological way (the RHS). 



Let P(S 4 , SU(2)) be the instanton bundle. Expression (12.88) reads as 

v+ - v_ = [ ch 2 (30 =f\[ tr9“ 2 . (12.90) 

Js 4 8 tt- J s 4 

The RHS represents the instanton number A' e 7T3(SU(2)) = Z. Note that k > 0 
if T = while k < 0 if 9“ = — * 97 It can be shown that v_ = 0 (v + = 0) 
if k > 0 (k < 0), see Jackiw and Rebbi (1977). For example, let 9“ be self-dual. 
Suppose i/ r_ e kerD* = ker DD^ . From (12.80), we find that 

DD'f- = [0 /t + A, L ) 2 + 215^3^]^- = 0 

where = (l/4i)(a ,4 ci; 11 — Q , ''a ,4 ). It is easily verified that 5 ,lv is anti-self-dual 
(CT ,tv = — *CT ll/i ) and hence a^ v 3' /41 ’ = 0. Since (3^ +A lJL j 1 is a positive-definite 
operator, it has no normalizable bound states. This verifies that ker D 1 = 0. 

12.7 The heat kernel and generalized £ -functions 

As we mentioned in section 12.2, there are several methods of proving the AS 
index theorem. The heat kernel is relatively accessible to physicists and it also 
has many applications to other problems in physics. The generalized £ -function 
is related to the heat kernel and also has relevance in physics. 

12.7.1 The heat kernel and index theorem 

Let £ be a complex vector bundle over an m -dimensional compact manifold M. 
Let A : F(M, E) -> F(M, E) be an elliptic operator with eigenvectors | n) such 
that 

A\n)=X n \n). (12.91) 

We denote the set of eigenvalues of A by Spec A. We assume that A is non- 
negative, i.e. all the eigenvalues are non-negative. Suppose there are no modes 
1 0, i), 1 < i < no with vanishing eigenvalue. In other words, 

dimkerA = no. (12.92) 

These modes are called the zero modes. Define the heat kernel hit) by 

h(t) = e -,A . (12.93) 

It is convenient to represent h ( t ) in the coordinate basis as 

h(x,y\ t) = (x\h(t)\y) = (x\ ^ e _ ' A |n)(n|y) 

n 

= ^2e~ tXn (x\n)(n\y). 


(12.94) 



Multiple eigenstates should be counted as many times as they appear. We assume 
{x\n) is orthonormal: f (n\x){x\m)dx = S mn . The convergence of (12.93) for 
t > 0 is guaranteed since A is non-negative. Taking the limit t -» oo, we have 

» o 

lim h(x, y; t ) = V(x|0, i){ 0, t|y> (12.95) 

t — ^ OO ^ — ' 

i= 1 

where the summation is over the zero modes |0, i) only. Thus, h — e _rA tends to 
be the projection operator onto the space of zero modes as 

«o 

e _ fA t«o^ |0j| .)( 0j| .|_ ( 1 2 . 96 ) 

i=l 

Define 

h(t) = f h(x,x;t) dx = J2 e ~ a "- (12.97) 

J n 

Then it follows from (12.95) that 

no = lim h(t). (12.98) 

t — >oo 

It is easy to verify that h satisfies the heat equation, 

^ + A, W, y; f) = 0. (12.99) 

If A is the conventional Laplacian, (12.99) reduces to the ordinary heat equation. 
The initial condition is 

h(x, y; 0) = ^^(x\n) (n\y) — S(x — y) (12.100) 

n 

where the last equality follows from the completeness of the eigenvectors. 

Exercise 12.4. Let u(x, t ) be a solution of (12.99) such that u(x, 0) = u(x). Show 

that 

u(x,t) — J h(x, y; t)u(y) dy. (12.101) 

[Hint: First verify that (12.101) satisfies the initial condition, next that it is a 
solution of the heat equation.] 

It is known that the solution of (12.99) has an asymptotic expansion for 
t e given by 


h(x, x; e) = cij(x)e' 


( 12 . 102 ) 



see Gilkey (1984). Similarly, h(t) has an expansion 


h(e) = ^ ate' 
i 

where <7/ = f a, (x)dx. 

Let E and F be complex vector bundles over M and D 
T (M, F) be an elliptic operator. We define two Laplacians 

A E = D ] D : T (M, E) -> T(M, E) 

A F = : T(M, F) -* T(M, F). 


(12.103) 


T(M, F) -» 

(12.104a) 

(12.104b) 


It is important to note that they have the same non-vanishing eigenvalues 
including the degeneracy. To see this, let A /-; | a) = a|a). Then there is a vector 
D\X) e F(M, F) such that 

A F (F>|A.» = DD f D\X) = DA e \X) = X(D\X)). 


Note that D\X) ^ 0 since ker A F — ker D. Conversely, if | /x ) e ITT/. F) 
satisfies A//|/x) = /r |/r), then D 1 |/u) e T(M, E) is an eigenvector of A E with 
the same eigenvalue //. Thus, we have found the symmetry 3 

Spec' A E = Spec' A f (12.105) 

where the prime denotes that the zero eigenmodes are omitted. 

Define two heat kernels h E and h E by 

h E (x,y,t) = y^e - ^" {x\n){n\y) (12.106a) 

h E (x, y , t) — Y ^ m (x\m){rn\y). (12.106b) 

We have 


lim h E {t) — dimker A E — dimkerD (12.107a) 

/— >oo 

lim h F (t) — dimker Af = dimkerD*. (12.107b) 

t — > OO 

What is more interesting is the index of D. Since ker D — ker A E and ker D ' = 
ker A E , we have 

ind D — dim ker D — dim ker D 1 = dim ker A E — dim ker A E 

= lim [h E (t)-h F {tj]=h E {t)-h F (t). (12.108) 

t— >oo 


The final equality follows since the /-dependent part of h E (t) — h E (/) cancels out 
by the symmetry (12.105). We expand h E (t) and h E (/) as 

h E (t) = aft 1 h E (/) — "Y a f t‘ ■ 

This is a kind of ‘supersymmetry’, see section 12.10. 


3 



Picking up /-independent terms, we have 


ind D = Gq — Oq = j d.r [a F (x) — a F ( x )] dx (12.109) 

where a F ' F (x) are defined in (12.102). 

E F 

In general, a {) ’ (x) are local invariants written in terms of curvature two- 
forms. In section 13.2, we use the heat kernel to prove the index theorem 


ind D — v+ — V- = 



vol 


for the twisted spin complex over a manifold with A(T M ) = 1. 
Exercise 12.5. Let D , , A p and Ap be as before. Show that 


I(s) = tr 


s 

A e + s 


s 

Ap + s 


Res > 0 


is independent of s. Show also that I(s) — ind D. 


( 12 . 110 ) 


12.7.2 Spectral £ -functions 

Let E and F be vector bundles over M. Define a new function 

£ E(x,y;s ) = ^2 {x\n)(n\y)X~ s Re.? > 0 (12.111) 


where Ap\n) — /.„ \ri) and the prime denotes the omission of the zero modes 
(X n = 0). A function £ p (x . v ; .v ) may similarly be defined for A/-. The functions 
hp and O; are related by the Mellin transformation. To see this, we recall the 
definition of the T -function, 

/»oo /»oo 

T(j) = / r s_1 e _? d t = X s / t s ~ l e~ x, dt 

Jo Jo 

where X is taken to be strictly positive. From this we find 


r(s)C(x, y, s) = Y] / t s ~ l e~ Xnl {x\n){n\y)dt 

n 

n oo r 

= / t s ~ l h(x, y, t) — V'(x|0, r > <0, i\y) 
Jo L 


d t. (12.112) 


We also note that 


CaO) = 



£(x, x; s) dx — X 


—S 

n 


n 


(12.113) 



is the spectral £ -function defined in (1.158). 

Exercise 12.6. Verify that 

A-7(x) = j S(x,y;s)f(y)dy (12.114) 

where the general power of an operator may be defined in the sense of an 
eigenvalue, namely we put A _,s |n) = X~ s \n). Re s is assumed to be sufficiently 
large so that (12.114) is well defined. [Hint: Use the completeness of the 
eigenvectors.] 


Example 12.7. The following example is taken from Kulkarni (1975). Let M — 
■S 1 = {e 10 } and E — F = a trivial line bundle over S l (a cylinder). Take an 
operator A = — 3 2 /3 0 2 . From the eigenvalue equation. 


3 2 e i,!0 

dd 2 


= mV" 0 


n e Z 


we find that 

X„=n 2 (9\n) = (27r)“ 1/2 e i,!0 . 

The heat kernel is 


h{0i,e 2 \t)= ^]e-” 2f (0i|n>(n|0 2 ) 

1 + 


,in(6»i-02) 


while 

aei,e2\s) = Y^n- 2s [ei\n)[n\e 2 ) 

= — V , n- 2i e in(ei_02; 

2jt ' 

We easily verify that /z (r) = 1 + Yl' e _ " 2f satisfies 

/ oo _ roo 

e~ x ' dx < h (t) < 1 + 2 J e 
We then find from these inequalities that 

r+oo 2 _ r+oo 

/ e r dx — 1 < /z it) < e 1 ’dr + 1 
J — OO J - 


(12.115) 


(12.116) 


or by putting the value 


/e- dx = 1//2 



we find 

~Jjtt~ 1 ^ 2 — 1 < h(t ) < s/nt -1 / 2 + 1. 

This shows that 

lim h(t) ~ -Jnr 112 . (12.117) 

»-> 0 + 

In general, the asymptotic series starts with 

12.8 The Atiyah-Patodi-Singer index theorem 

So far we have been concerned with index theorems defined on a compact 
manifold without a boundary. In practical situations in physics, we often need 
to find an index of an operator defined over a base space M with a boundary. 
The extensions of the AS index theorem to these cases are discussed here. Our 
argument is restricted to the spin bundle over M since this is the only situation we 
shall be concerned with in chapter 13. 

12.8.1 //-invariant and spectral flow 

Let iy be a Hermitian Dirac operator defined on an odd-dimensional manifold M, 
dim M — 21 + 1 . Since iy is Hermitian, the eigenvalues 'kk are real. We define 
the //-invariant of iy by the spectral asymmetry of iy, 

1. (12.118) 

A.fc>0 

This is not well defined and requires a proper regularization. For example, we 
may define ;/ by lims-j.o bC 5 ) where 

r]{s) = ^'sgn(L fc )|^r 2 " Re i > 0. (12.119) 

k 

It can be shown that, under proper boundary conditions, // (,v) has no pole at .y = 0. 
Exercise 12.7. Use the Mellin transformation 

ir a“ ( " +1)/2 = J™ dxx s e~ ax2 a> 0 

to verify that 

, 1 (s) = — — / djtx s triye“*~ (iy) . (12.120) 

r(i(i + 1)) Jo 

Suppose a Dirac field is interacting with an external gauge potential A t , t e 
[0, 1]. The Dirac operator i ^ (A t ) has a /-dependent eigenvalue problem. If 
an eigenvalue of \f(A t ) crosses zero, the //-invariant jumps by ±2. This jump 



Figure 12.1. Whenever an eigenvalue X crosses zero (a), the ^-invariant jumps by ±2 (b). 
The sign depends on the way in which X crosses zero. 


denotes the spectral flow from X -s 0 modes to /. *5 0 modes; if rj jumps by 
+2 (—2). there is a flow of a state from X < 0 to X > 0 (A. > 0 to X < 0), see 
figure 12.1. In addition to the discontinuous change associated with the spectral 
flow, iy also has a continuous variation r], . We have 

r](t = 1 ) — ij(t = 0 ) = f d t—^~ + 2 x (spectral flow). ( 12 . 121 ) 

Jo d t 


12.8.2 The Atiyah-Patodi-Singer (APS) index theorem 

Let us consider a (21 + 2)-dimensional Dirac operator 


i£> 2 M -2 = ieri — + <72 < 8 > iVl A,) — 

at 


0 D \ 
D + 0 ) 


where 

D = id,- 1/(A t ) D f = id, + y (A,). 
[Remark: The positions of D and D 1 are reversed since 


y 2/ + 3 = 


-1 0 

0 1 


for our choice of /-matrices; cf (12.79).] 


( 12 . 122 a) 

(12.122b) 


Theorem 12.3. (Atiyah-Patodi-Singer theorem) Let M be an odd-dimensional 
manifold and i'f (A,) a Dirac operator on M interacting with an external gauge 
field A,. Then, 

ind D — dim ker D — dim ker D * 

= f A(^)ch(T)|vot - £[i?(iy(./li)) - ri(if(A Q ))]. 

JMxI 


(12.123) 



The general argument shows that the continuous part rj, of the ^-invariant 
satisfies 

f dt -j- = 2 [ A(^)ch(T)| vo i. (12.124) 

J 0 dt JMxI 

Then the RHS of (12.123) is simply the spectral flow 

1 1 dri c 

— [r](t = 1) — r](t = 0)] H — / dr = —spectral flow. 

2 2 Jo dr 

Thus, we find another expression for the APS index theorem, 

mdiDit+i = —spectral flow. (12.125) 

The proof of the APS index theorem in its most general form is found in Atiyah et 
al (1975a, b, 1976). The physicists’ proof is found in Alvarez-Gaume etal (1985). 
We use the APS index theorem to study the odd-dimensional parity anomaly in 
section 13.6. 


Example 12.8. To see why the spectral flow appears in the index theorem, we 
consider an example taken from Atiyah (1985). Let M = S l and 9 be its 
coordinate. Consider a Hermitian operator 


iV, = i 



— i3e + t 


t e ffi. 


(12.126) 


The term — ir is thought of as a U(l) gauge potential. The eigenvector and the 
eigenvalue of iV, are 

*n,tm = e Z) Xn(t) = n + t. 

\/2n 

Since Spec iV, = Spec iV, + i , the family of operators iV, is periodic in t with the 
period 1 , see figure 12.2. This periodicity manifests itself in the gauge equivalence 
of iV f and iV,+ i: 

iV m = e ie iV,e- ie . 

There is precisely unit spectral flow from /, < 0 to '/. > 0 at f = 0 while 1 changes 
from — e to 1 — e, e being a small positive number. From iV r , we construct a 
two-dimensional Dirac operator 

i0 2 = rn <S> ^ + <r 2 ® iV, = ^ (12.127a) 

where 

D = id, + dg-it D f = i9, - dg + it. (12.127b) 

These operators act on functions which satisfy the boundary conditions 

4>{0 + 2 tv, t) = <j)(9 , t) <p(9, t + 1) = e 10 <p(9. t ). 


(12.128) 


m 



Figure 12.2. Time evolution of the eigenvalues of iV r . SpeciV r has period 1. The ith 
eigenvalue crosses zero at t = 0 and, hence, there is a unit spectral flow. 


Let (po e ker D ' . We have a Fourier expansion 

<Po(0, t ) = 

It follows from I) ' (pa = 0 that 

a' n (t) + (n + t)a n (t) — 0 
which is easily solved to yield 


a„ (r) = c n exp | 


(n + t)- 


The boundary conditions (12.128) require that 
(n + t + l) 2 N 


c « ex p i 


e 11,8 — ^2 C n exp I 




from which we find that c„ is independent of n. Thus, ker /) ' is one dimensional 
and is spanned by the theta function. 


(n + t)- 


<fo(0, t) = J^exp 


2 


— inO 


(12.129) 



Suppose (f>o(6 , t) e ker D. If we put 4>o(9, t) — ^ b„(t)e ln6 , b„(t ) satisfies 

b' n (t) - (n + t)b n (t) = 0. 

The solution of this equation is 

(n + t) 2 

b„ ( t ) = b„ (0) exp 

and, hence, (po cannot be normalized. This shows that 

ind D — dim ker D — dim ker D 1 — — 1 
which agrees with —(spectral flow). 

12.9 Supersymmetric quantum mechanics 

We present, in the next section, the physicists’ proof of the index theorem in 
its simplest setting. The proof is heavily based on path integral formulation of 
supersymmetric quantum mechanics (SUSYQM), which will be outlined in the 
present section. 

We have studied the path integral quantization of bosons and fermions. 
If these particles are combined together, there appears a new symmetry called 
supersymmetry. We will introduce a special class of SUSYQM later, which 
turns out to be crucial in the proof of an index theorem. 

This and the next sections may be read separately from the previous sections. 
The necessary tools are supplied to make these sections self-contained. Our 
exposition follows Alvarez (1995) and Nakahara (1998). Original references are 
Alvarez-Gaume L (1983) and Friedan and Windey (1984, 1985). 


12.9.1 Clifford algebra and fermions 

We restrict ourselves to a particle moving in M 3 to start with. More general 
settings will be studied later. Let {ij/j} — {i/t, ^2, ^ 3 } be real Grassmann 
variables, where i = 1, 2, 3 labels the coordinate index. They satisfy the algebra 

Wi, ifj} = 0 

Let us consider the Lagrangian 

i . i 

L = -fifa - -< ij k Hi ip, 

where B, is a real number. The canonical conjugate momentum for 1 //, i 


m = 


9L 

Wi 



(12.130) 



Then the Hamiltonian is 


H = -i h^i'i ~ L = (12.131) 

The Poincare one-form of this system is 

6 = ^ i/, d\[/j . (12.132) 

The corresponding symplectic two-form is 

co = dd = ^ dfi A d ft (12.133) 

from which we obtain the Poisson bracket 

WjMrk]m=i8jk. (12.134) 

Quantization of the system is achieved by replacing this Poisson bracket by the 
anti-commutation relation 

= (12.135) 

This anti-commutation relation is called the Clifford algebra in IF':’ . Let a, be the 
r th component of the Pauli matrices. It is easily verified from the observation 

{(Tj , (Xk) = 28 jk 

that t/q = <7j /V 2 is the two-dimensional representation of the Clifford algebra. 
It is known that the finite-dimensional irreducible representation of the Clifford 
algebra is unique (modulo conjugate transformations). Thus, the Hilbert space of 
this system turns out to be T-L — C 2 . The Hamiltonian is rewritten in terms of the 
Pauli matrices as 

H = —\B ■ a . (12.136) 

This Hamiltonian is known as the Pauli Hamiltonian and describes a spin in a 
magnetic field. 

Similarly, the Clifford algebra defined in M 2 " and K 2 " +1 acts on the Hilbert 
space 'H = C 2 " . 

12.9.2 Supersymmetric quantum mechanics in flat space 

The Pauli Hamiltonian is made only of the spin coordinates i//, and is independent 
of the space coordinate Xk- Accordingly, it cannot describe a travelling spin. Now 
the Hamiltonian is modified so that the spin may move around the space. This can 
be realized by adding a kinetic term to the Hamiltonian. Let us consider a spin in 
W 1 and put B — 0 to obtain the Hamiltonian 

1 i 

L = -XkXk + Mk- 


(12.137) 



The coefficients of this Lagrangian have been chosen so that the system has a 
supersymmetry defined later. The canonically conjugate momenta are pk — Al- 
and 7tk = —iirk/2, from which we obtain the Poisson brackets of the system 

[Xj,Xk ] PB = [pj, Pk ] PB = o [Xj, pk ] PB = [fj, ^k ] PB = Sjk. 

It is easy to derive (anti)commutation relations from these Poisson brackets. The 
canonical (anti)commutation relations are 


[xj,Xk] = [pj, Pk] = 0 [Xj, Pk] = = Sjk- (12.138) 

The Hamiltonian is 

H = XjPj ~ ~ L = \p 2 = ~\ A (12.139) 

where A = Ylt=\ H ' s ^ lc ^-dimensional Laplacian. The Hilbert space on which 
H acts is L 2 ( E^) <g> C 2 " , where L 2 (ffi rf ) stands for the set of square-integrable 
functions in W and n = [ d /2] is the integer part of <//2. 

Variation of the Lagrangian yields 


SL = kj Jt 8xj + + 5^'df 5 ^- 


Let us verify that the Lagrangian is invariant under the following supersymmetry 
transformation 

Sxj — \e\j/j Si/fj — —exj (12.140) 

where e is an ‘infinitesimal’ real Grassmann constant. In fact, 

i . i 

SL = vcjeifj - -exjfj - -^jexj 

i . i d i . 

= vcjtfj ~ ifXjfj - - — (tj€Xj) + -ifjexj 

= (12 ' 141) 


and the action S — f Ldt is left invariant. The corresponding charge (the 
generator) is called the supercharge and defined through the Noether’s theorem 

4 

as 

eg = iepjfj = iexl/jpj — i efjXj. (12.142) 


Exercise 12.8. Show that 


8xj — [xj.eQ] 

Hj = {Vg,eg}. 


4 


Note that the mass of the particle is set to unity and hence we have pj = xj . 


(12.143) 

(12.144) 



These equations show that Q is the generator of SUSY transformations. 


Let us take d — 2 n to be an even integer and quantize the system in the 
following. We introduce the matrix representation i J/j — yj /\[2, which is the 
generalization of the two-dimensional representation introduced in the previous 
subsection. Here yj are the (/-dimensional Dirac matrices that satisfy the Clifford 
algebra 

[yj,y k } = 2Sij. (12.145) 

The Hamiltonian acts on the Hilbert space 

n = L 2 (K 2 ")®C 2 ". 

The supercharge takes the form, upon diagonalizing the coordinate, 

112 1461 

The operator 

t = Yj-Z~ (12.147) 

is nothing but the Dirac operator in Euclidean space M 2 " and plays an important 
role in the proof of the index theorem. 

The hypercharge Q transforms in an interesting way under an SUSY 
transformation (12.140) 

d 

SQ — i (Sx[rj)xj +i\jfj—8xj — i (-exj)xj + i%J/j(ie\J/j) 

= ~ KXjXj + efjtj = “ 2if (j-W + 

= — 2ieL. (12.148) 

Namely, the variation of the supercharge under an infinitesimal SUSY 
transformation is the Lagrangian ! 

We next consider the relation between the supercharge and the Hamiltonian 
of the system. Let us consider successive SUSY transformations with Grassmann 
parameters c\ and e 2 . If a transformation with e\ is applied first and then e 2 next, 
we obtain 

Xj xj + iet r/fj % xj + i(ei + e 2 )\J/j - let e 2 kj 
fj ^ ifj ~ ft Xj 3- fj - (ei + e 2 )xj - iei6 2 i j/j 
while if the order of the SUSY transformations is reversed, 

Xj Xj + i(ei + e 2 )\J/j - ic 2 eii ; - 
tj -> fj - (ei + e 2 )xj - i6 2 6i ij/ j . 



We find, from these results, the commutation relation of the SUSY variations: 


[<5 €2 , <5 ei ] = <5 0 <5 fl - 5 ei 5 e2 = -2iei62 — . (12.149) 

c it 

The observation that the commutation relation of two SUSY transformations 
is a time derivative, i.e. the Hamiltonian, suggests that the anti-commutation 
relation of the supercharge, the generator of the SUSY transformation, also yields 
the Hamiltonian. In fact, 

{Q,Q} = 2Q 2 = 2(ipj\lfj)(ip k \lr k ) 

— -pjPkifji'k + ^k^j) = -pjPkSjk 
= -2 H. 


After all, the SUSY algebra reduces to 

Q 2 = -H. (12.150) 

Since Q is anti-Hermitian, the Hamiltonian is a Hermite operator with non- 
negative spectrum. 

In summary, we proved in equations (12.148) and (12.141) that 

1 d<2 

8Q = -2ieL SL = -e — . (12.151) 

2 dr 

If these equations are compared with the SUSY transformations (12.140) of 
the coordinates xj and xjfj, we readily notice that the roles played by bosonic 
quantities (xj and L ) and the fermionic quantities (i jrj and Q) are interchanged. 
Note that the variation of the supercharge Q in (12.151) is always a time derivative 
of the Lagrangian L. This observation is crucial in constructing a SUSY-invariant 
Lagrangian out of a supercharge Q. 

12.9.3 Supersymmetric quantum mechanics in a general manifold 

Let M be a Riemannian manifold with dim M — 2n. The Riemannian metric is 

ds 2 = g^v dx M dx v 

and the inner product of two vectors X and Y with respect to this metric is denoted 
as 

(X, Y) = g„ v X»Y v . 

The vector ^(f) belongs to T M X ( t ) at each instant of time t. Therefore, 
i (r) obeys the ordinary transformation rule for a vector under the coordinate 
transformation — > x ,,i = x ffl (x v ): 



Then, under the SUSY transformation 8 = S € , the coordinates transform as 


8x = 


dx'^ 

dx v 


Sx v 


dx'P 

~8x ~ 


iei j/ v = iei 


and 


8f 


— 


d 2 X 

dx v dx x 
d 2 x^ 
dx v dx x 


8x x \jf 


dx 

+ ~ — ~dif 


i eYY + 


8x v 

dx'* 


dx' 


-(-iei v ) = 


where the anti-commutativity of Grassmann numbers has been used to obtain the 
last equality. These transformation rules show that the SUSY transformation is 
covariant under the coordinate transformation x 1 ' -> x ,,L . 

The supercharge Q introduced in the previous subsection should be 
generalized on the manifold M as 


Q = i{x, iff) = i g fiv (x)x^if v . 


(12.153) 


The SUSY-invariant Lagrangian on M is constructed from the SUSY variation of 
this Q as 

SQ= \dxg liv 8x k x IJ -f v + i g flv 8x IJ 'if v + ig^x^SY 

— idxgp.v^^^Y + igp V (ieY)Y + ig fiv x ,i (-ex v ) 

= -2ie I ^gn v x^x v + ^gpvYY 

(3 Xgpv - 3 vgftX - d/jgXv) YY 
= -2ie ( + '-g^YY + ^gx p r p p V YY 


where 

r'V = {g vp (3 xgpp + dpgxp - dpgxv) 

is the Christoffel symbol associated with the Levi-Civita connection. Note the 
symmetry T X p V — T x V p. By comparing this SQ with (12.151), we read off the 
Lagrangian, 


L = ig ltt y(x)i ,i x v + ^gp V {x)Y 


di Y 
d t 


+ X k r \ K (x)r 


1 . . i 

= 2 {x ’ X} + 2 


if. 


Dir 

~Dt 


(12.154) 


Here Difr/Dt is the covariant derivative of x[r along the curve x(t). 



Exercise 12.9. Show that the SUSY variation of the Lagrangian is proportional to 
the time derivative of the supercharge, 

1 d Q 

SL=-e (12.155) 

2 dr 

The quantum version of the supercharge is 

Q ~ g^P^Y" (12.156) 

that is the Dirac operator $ on M. 

Let us define some symbols that will be employed in the next section. The 
connection one-form is 

r^ = dx x r^ v (12.157) 

while the Riemann curvature two-form is 


WX = dT'i + T^ A T° v . (12.158) 

The Riemann curvature two-form is expanded in terms of dx p A dx a to yield 

77'j, = \R% pa dx p A dx a (12.159) 

the component of which is the ordinary Riemann curvature tensor. This 
component is also written in terms of the connection V /; as 


a r'/C o t^/C _i_ r~</c 

— ill °V i i 1 ,, 1 A a n 1 //1 A vn • 


(12.160) 


12.10 Supersymmetric proof of index theorem 

The proof of the index theorem in its simplest setting will be given in the present 
section by making use of the supersymmetric quantum mechanics developed in 
the previous section. 


12.10.1 The index 

Let us consider vector bundles E± — M, E — E + © E- and let V be an elliptic 
differential operator acting as 

D : T(M, E + ) -> T(M, E~). 

It is possible, by using the fibre norm, to define the adjoint of V as 
V f : T(M, E~) -» F(M, £+). 



Assuming that D is Fredholm, the index 


Ind D — dim ker D — dim ker D ' (12.161) 


is well defined. 

Theorem 12.4. The number ind D is invariant under a ‘small’ deformation of D. 


Proof. Note, first, that DD and V' V are non-negative and, hence, it follows that 

kerD = kerD' V ker V ' — ker VV ' . 

Let {<p n } be the orthonormal set of eigensections of V'V : Y{M.E + ) -* 
T(M, E+y. 

C D ] T>)(j) n = k n <p„. 

Define \[r n = ’D(p n / *Jk~ n for k n > 0, namely e (kerD)- 1 -. Then we find that 
i fr n is an eigensection with the same eigenvalue X n , namely i fr n e (ker V ' )-*- since 

( VV^)f n — V{V' Vfn) t s/Yn = k n V(p n /^K = X n \jr n . 


Note also that { \// n } is an orthonormal eigensection. 


{ fn f m ) — 


\J ).fi 


p n \V ] V\(j) m ) = 






Thus, it follows that there is a natural isomorphism between (ker'D) ' and 
(ker'D' Note, however, that there exists no such isomorphism between kerD 
and ker D ’ . Suppose N states in ker D obtain non- vanishing eigenvalues as a 
result of a small perturbation of the operator D and dim kerD decreases by N. 
Then it follows from this observation that the same number of states must also 
leave kerD 1 . Otherwise (kerD)- 1 is no longer isomorphic to (ker V^)^. Similary, 
if dim kerD increases by N , dim kerD 1 must also increase by N to keep the 
pairing properties of (ker V) 1 - and (ker D : ) 1 ■ Therefore, ind D is invariant under 
small perturbations of D. □ 


Theorem 12.5. Let D be a Fredholm differential operator. Then its index is given 
by 

indD = Tre-^ 1 ” 0 - Tre“^ x,t (12.162) 


where f > 0 is a real constant. In fact, the index is independent of f>. 



Proof. The traces in (12.162) are over {<p n } and {i j/ n }, respectively. Let {0?} 
and { \J/9 } be orthonormal eigensections of kerP and kerP^, respectively, and 
1 < i < dimkerP and 1 < j < dimkerP 1 . Then it follows that 

Tre-^-Tre-^ 

= 'Y^{(j) n \t~ pV ' V \(p n ) - ^ (f n \e~ l ' l ' Dvi \ir n ) 

A n 7^0 A„ 7^0 

+ X>°i 

i J 

= J2 e ” M " - Wn\tn)) + E 1 “ E 1 

AnT^O i j 

— dim ker P — dim ker P ' 

= indP. 

Since the summations over i and j are independent of ft. indP thus defined is 
independent of ft . □ 

The trace that appears in theorem 12.5 is identified with the heat kernel. 
Let E = E + © E- and define a differential operator acting on E by 5 (cf 
equation (12.79)) 

i G = ( £ V q ) :E-+E. (12.163) 

Moreover, define a ‘Hamiltonian’ and a matrix T by 

H = (i g) 2 = ( f vt ) r = ( l ) . (12.164) 

Since Q thus defined is anti-Hermitian, the operator H is Hermite and non- 
negative. The index of P is rewritten in a compact form by making use of T 
as 

indP = TrTe _/iw . (12.165) 

Let M be a spin manifold, for which the second Stiefel-Whitney class 

W 2 (M ) is trivial. Accordingly, the SO (A) principal bundle over M may be lifted 
to the SPIN (A) principal bundle as 

SOOfc) -> SPIN (A'). 

I i x 
M 

Let E = A (M) be this spin bundle. Then, associated with A (M) is a Clifford 
algebra {y 1 ', y v } — 2(V' V . Let us define the chirality operator 

X2«+l = i n Y\Y2 • • • Y2n- (12.166) 
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The operator Q will be identified with the supercharge later. 



It follows from y^n+i = I that the eigenvalues of yin+ 1 are restricted to be ±1, 
which we call chirality. 

Exercise 12.10. Use the Clifford algebra to show that 

Yln + 1 = 1 {Pja> Y2n+l} — 0. 

The set of sections T (M, A) for an even k is not an irreducible representation 
of SPIN (A:) but can be decomposed into two subspaces according to the chirality 
as 

T(M, A) = T(M, A+) © T(M, A - ) (12.167) 

where \jr± e T(M, A 2 * 2 ) satisfy Y2n + 1 V / ± = We assign the fermion 

number F = 0 to sections in T(M, A + ) while F = 1 for those in T(M, A - ). 
Then the F defined in (12.164) can be written as 

T = (-1) F . (12.168) 

It is clear that the operator Q flips the chirality and hence \ Q. F| =0. 

Let Q be the Dirac operator on M and let T = Y2n+ 1 ■ In fact, it follows from 
exercise 12.11 that { Q , y 2 »+i} = 0 and /2n+l is identified with (— l) f . When T 
is diagonalized as in (12.164), the chirality eigensections are expressed as 6 

= ( *0 ) *- = ( Z ) ■ <l2169) 

It should be then clear that X> : T(M, A + ) —> T(M, A - ) and : T(M, A - ) -> 
F (M, A + ) are identified with D and I) 1 , respectively, in (12.79). Accordingly, 
the index of the Dirac operator is defined as 

ind Q = dim ker D — dim ker /) ' . (12.170) 

Physicists often call the sections in kei' I) and ker D zero modes. Then, the 
index of the Dirac operator is the difference between the number of positive and 
negative chirality zero modes. This index has a path integral expression as we see 
in the next subsection. 


12.10.2 Path integral and index theorem 


Let us consider a Dirac operator Q on a 2n -dimensional spin manifold M. We 
employ Euclidean time (t — > —if) from now on. 

Let H = (i<2) 2 = \gnvP lx p v be the Hamiltonian corresonding to Q. Then 
the index of the Dirac operator has a path integral expression 


ind Q = Tr T e~^ H = Tr(-l) f e“^ ff 


f VxVxlre~fo dtL 

J PBC 


(12.171) 


6 Note the slight abuse of notations. The symbols x/f± have been used to denote sections in T(M, S ) 
as well as those in T(M. A ± ). 



where the Lagrangian L has been introduced in (12.154), 


1 „ „ 1 

L = -g»v(x)x»x v + -g^xW 1 -^ (12.172) 

and PBC stands for the boundary condition in which the path integral is over 
functions satisfying a periodic boundary condition over [0, /!]. The factor (—\ ) F 
disappears if the anti-periodic boundary condition for the fermionic variables is 
changed into a periodic one. This can be seen from the following observation. In 
the path integral formalism, the trace with (— 1) F is (see section 1.5) 

tr(— 1 ) F e~ pH = ^](n|(-l) F e“^|n> 

n 

= J d9*d9{-9\(-l) F e~ pH \9)e~ e * e (12.173) 

where F = c'c is the Fermion number operator. By noting that 

1 9) = |0> + |1>0 (-l) f \9) = |0> - |1>0 = | - 9) 

this integral is cast into the form 

J d(9* d9{9\t~ pH \9)e~ e * e . (12.174) 

Thus, by eliminating (— \) F , we have to change the boundary condition to a 
periodic one. 

This path integral is evaluated in the rest of this section to show that it reduces 
to a topological index obtained from the Dirac A-genus. 

The SUSY transformation in Euclidean time is obtained by the replacement 
t — > — ir in (12.140) as 


Sx ^ = iei jr^ Si// 11 = — iei^. 


As was shown in the previous subsection, the index is independent of f J > and, 
hence, we may consider the limit /) 0 in computing the trace. By rescaling the 

time parameter as 1 — jis, we cast the action into the form 


f 


dr 


i „ „ i „or 

-gvLvWx^x + 2 8 ^ X ^ ~of 


<1 


= f ds 

Jo 


11 dx /; dx v 

--iVUv) — — 


1 „DiA y 

2^ (X) ^ OT 


(12.175) 


Thus, any path with x ^ 0 has an exponentially small contribution to the path 
integral in the limit /3 \ 0. Accordingly, the contributions to the path integral 
come only from paths x(t) = constant in this limit. Clearly, these paths satisfy 
the periodic boundary condition. 



The periodic boundary condition forces us to take the set of loops in 
M, which we will denote as L(M), as the configuration space of the bosonic 
coordinates. To apply the saddle point method to the evaluation of the path 
integral, we have to find the set M of the extrema of the action, namely the 
solutions of the classical Euler-Lagrange equations 


Dx^ 

-s^'-dT 

Dtp 

Dr 


1 


2 

di/f^ 

dr 


Rpv x P V L Vx p = o 


+ = o. 


(12.176) 

(12.177) 


It is instructive to outline the derivation of these equations since the anti- 
commutativity of Grassmann numbers and the symmetries of the Riemann tensor 
are fully utilized. The Euler-Lagrange equation for \l/ p is 


0 = 


d 

dr 

1 Dx/s v 

2 8pv Dr 


dL 

dij/ p 


Spv 




Dr 


8L 
dij/ p 

\s KV rx x ri P + \j t ( gpv r) 

- gK V x X Tl p f K + (dxgpv)x X V + gpvV 


By multiplying both sides by g^ p and summing over p , we have 
n ilr^ 

o = - g pp g KV k x Tl p r + g w (Sxg pv ) x x r + v 

= ^ + r+* x |> (a x §PV ) - g pp g v ,n P ] r = 2^ 

which proves (12.177). Here, use has been made of the identity 

g pp [(d\gpv) - j(dxg vp + dpg v x - 9 y g A p)] 

— g 1 P J {d^-gpv + dvgXp — dpg v x) = 

in the square brackets in the second line above. 

Let us prove the equation of motion for x /l next. We find 

dL d / dL \ 

dx p dr \ dx p J 

= ^(dpga/3)x a x fi + ^(dpg afi )ir a ^- + ^gap'lr a x X d li rP K \lr lc 

- ^ ( 8nvX V + ^gafiV 
= ~[g/-iv'x V + jidxgpv + d v gi_a - dpg v x)X V X X ] 



+ 2^-8apd^x.K dxga/3rff JC g a pdxr^]i/r i jr X 

+ ks^iyVr^r + ^r^rir 

= ~~ ^^ V ~dT "*" 2^“ |8 ^ t ^' c ~ 8aP^kY pK — 9kgapY^ K 

+ SypriA + g^Yl^rVx 1 

= -g^^- + - wk + r '^Dm x 

+ k(8yf> r L ~ dxg a p)rP K m x . 

The last term of the last line of this equation is written as 

[g y p\g YV (hgvc‘ + ^agvk - dvgla) - hgaplY^f 01 ^^ 
= ~\(hgap + dpgia - d a gxp)T^ v -ilt 0 ' \jr K i 1 
= -r ak pr^ a ir K i x 
= -gccpY^rn 1 


from which we obtain 


° = -^15r + ~ 9 ^ + Kr r L 

Dx v 1 

— ~g/iv—^ \~ — Ro/K/j.kf l/ r X . 


k ~k 


- 1 ky^rr* 


Equation (12. 176) follows by renaming dummy indices. 

Let us come back to the study of the solutions of the equations of motion 
(12.176) and (12.177). Clearly, the pair* = constant and ijr = constant is one of 
solutions. Therefore, x p : t i-> p e M is always contained in the solutions, which 
may be written as M C M. Equation (12.176) reduces to the geodesic equation 
when \jr = 0 but not necessarily so in general. When the fundamental group 
it 1 ( Af ) is non-trivial, there exist non-contractible geodesics in general. Their 
contributions to the path integral, however, vanish exponentially as exp (— c//J) 
as p l 0 and, hence, are negligible. 

Before we proceed to the proof of the index theorem, we need to explain the 
saddle point method. Let us start with a simple example. Consider the integral 


Z = 



dx 

s/2ith 


Q ~f(x)/h 


The function fix) is assumed to have only one minimum at x = xq and that 
f{x) —> 00 as x -> ± 00 . Let us consider the asymmptotic expansion of the 
integral Z when the limit h —*■ 0 is taken. Put a = xq + ~Jhy and expand fix) at 
xq. Taking fix 0 ) = 0 into account, we obtain the expansion 

f{x) = fix 0 ) + ifiy 2 /"(xo) + ^/t 3/ V / (3> (* 0 ) + ^ 2 // (4) (xo) + • • • . 



If this expansion is substituted into Z, we have 


Z=e -f^r^ - 

J — oo V2tt 

x exp ^- l -y 2 f"(x 0 ) - (V /2 y 3 / (3) (*o) + f {A \x 0 ) + • • • ) 

Let us define the moment of y by 


<y"> 


J 


dy 

V27T 


y" e ~y 2 f "(*0)/2 


/ 


dy _ v 2 


\fljz 


y z f”(x 0 )/2 


Then we hnally obtain the expansion of Z as 


Z = ■ ( exp 


/Ttai) 


-i *'/y/®(«)-i*// (4 w 


One might think that one will get terms of order Of/z 1 / 2 ) if (■ • • } is expanded. 
However, this is not the case since (y 3 ) = 0 and one has (• • • } = 1 + 0(h) in 
reality. In the proof of the following index theorem, the parameter h is replaced 
by p. The index is, however, independent of ft and we conclude that terms of 
order O(ft) vanish and, hence, we need to take only the extrema of the action and 
the second-order fluctuations thereof into account. 


Exercise 12.11. Use the previous expansion to prove the Staring formula 

n\ ~ V2 \une~ n n n (12.178) 


for n 1 . 

Let us come back to SUS YQM. We take the second-order fluctuation around 
the solutions of the classical equations of motion in evaluating Z. The principal 
contribution to the path integral comes from the solution x = xq and i// = t/t 0 . We 
employ the Riemann normal coordinate based at x = xq to make our life easier. 
This is to take a coordinate system in which the metric tensor satisfies conditions 7 

3 

giiv(x o) = j-j-g^ix o) = 0. 

Thus, we have g = det g — 1 . We define the fluctuations in this coordinate system 
as 


x»(t)=x£ + t»(t) 

= Vq + t ). 

Of course, this choice does not imply that the Riemann tensor vanishes in general. 
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Note here that dx /x = d^, d i/r M — dr] 1 *. The second-order expansion of the 
action is now written as 


5 2 = f d t 
Jo 

where we have put 


1 d£" d£" 1 „drif 1 1 - „d£ v ' 


'R'lJ.vix o) — 2 ^l ! v P a (*oW 0 * 


(12.179) 


Needless to say, the zeroth-order action 5o = 5(xo, i//o ) vanishes identically. 
Let us evaluate the index 


J 


ind Q = / V^Vrje 


*-s 2 


(12.180) 


using the second-order action 52 . Here we have taken the translational invariance 
of the path integral measure VxT>iJ/ = V^Vi], Taking the periodic boundary 
condition of f , 17 into account, their Fourier expansions are given by 


— 1 \ " tp 


lijlnint / (5 


n=—o o 
00 


?7 a 


= E « 


V? ^ 

n~— 

The fluctuation operator for § in 52 is 


fi 2n int / (5 


d 2 ~ d 

+ Tv 


dr 2 


^/iv 


d t 


while that for rj is 


d 


We have to consider the zero modes ^ and rj ^ , for which n = 0, separately in 
the following Gaussian integrals . 8 Taking these into account, we write 


ind Q 


-"In 


dt “ < 




Detpsc ( <V„ — 


dr 

. I d z d 

DetpBC I —8/iv ~2 + T^ixv(xo) — 


1/2 


- 1/2 


-"In 




=1 V2jr 


Detpsc ( + 'R. flv (xo) 


- 1/2 

(12.181) 


The integrations over and t)o 316 equivalent with those over xq and i/tq. 



where / indicates that the zero modes are omitted while M is the normalization 
factor, which takes care of the ambiguities associated with the ordering of 
Grassmann numbers. Let us evaluate this factor now. 

Since ind Q is independent of ft, we put ft = 1 for simplicity. We also 
simplify our calculation by choosing the metric to be g /lv = 8 /lv . Then the 
fermion and boson parts separate completely. The fermionic part is evaluated, 
by noting //fermion = 0, to yield 

Tr K2)i+ i= [ T^e-UVM 
JPBC 

= A ff Detp BC ((5 /t v9r)' /2 j d^o 1 • • • d^ n , 

where i fr^ is the zero mode. The determinant is evaluated as follows. First, note 
that the argument in section 1 .5 shows that the determinant is, in fact, 

Detp B c Or + co) = lim Det' ((1 — eoo) d t + co) 

£— >0 


where we have introduced the harmonic oscillator frequency co, which will be set 
to zero at the end of the calculation. The ‘partition function’ is 

tr(— 1 ) F e~P H = 2sinh(/3®/2) 

= e^ /2 Detp BC ((1 - s(o)d t +co) . (12.182) 

Therefore, the determinant in the limit co — »• 0 is 

Detp BC Or) = lim e~ p<v/2 2smh(ftco/2) = 1. (12.183) 

&>— >0 

Thus, we finally obtained 

Tr Y 2n+ 1 = M f J d^ . . . dxftl". (12.184) 

We insert 

Y 2 n+\ = i " Vo ■ ■ ■ Yo" = (2i)"V f o • • • fo" 

further in the trace. Since Tr Yi n+ \ = Tr / — 2 n , we obtain 

Tr)4 +1 = 2" = A f f j d^o . . . dxft 2n (2i) n ^ . . . i/r 2 '' = ^/(-2i) n 
which leads to 

Jf f — i" . 

Next, we evaluate the normalization factor Mb of the boson part. If we employ 
imaginary time in (1.101) to obtain (. x , l|x, 0) = (27r) -1 / 2 , we have 


j = N b 


Det 1 / 2 (— 5 /xv ,3 2 ) 


. In 

/ n , 

#t=i 


dx ,L 

\f2jt 


= (2tt)- 


. 2 n 

/n ■ 

fi=t 


dx'L 



The determinant is evaluated using the / -function regularization as in section 1 .4. 
The eigenvalue of — d 2 /dr 2 with the periodic boundary condition is — 
(2mx / ji) 2 and then 



The spectral t, -function is 



from which we find 

S'_ d 2 /dt 2(0) = 41og(/l/2jr)e 2slo ^/ 2 ^(2s) + 4e 2slo ^ 2jI ^\2s)\ s= o 
= 4Pog(/8/2w)f (0) + ?'(0)] = -2 log /l. 

Therefore, the determinant is 


DetpBc 



ex P[-^ d 2/d/ 2(0)] = ^ 2 ' 


(12.185) 


By putting /l = 1, we find Detp fic (— d 2 /dr 2 ) = 1. Thus, we have obtained the 
normalization factor 

Mb = 1. 

Putting these results together, we have shown that M — M fMb = i” • 
Accordingly, the index is expressed as 


ind Q — i' 


ft ! 




_ j \[2jt 


DetpBc ( -5^— + 'R-nv(xo) 


1 - 1/2 


(12.186) 


Let us evaluate the functional determinant in (12.186). Since the Fermi 
variables are contained only in Ti^v (xq) and this is Grassmann-even, we pretend 
this part is a commuting number for the time being. The anti-symmetry of the 
Riemann tensor implies that 77 /iu (xo) satisfies 7?. /iVJ = —TZ VIJ/ . Therefore, it is 
possible, in an even-dimensional manifold M, to block-diagonalize 72 MI , in the 
form 

/ 0 >’i \ 

-yt 0 


Mm, — 


0 

yn 


yn 

0 


(12.187) 



Let us concentrate on the first block. The operator 


v , "b 7Z[iv (*(,) 

is real and, hence, the eigenvalues are made of complex conjugate pairs. Let us 
express the determinant of this block in terms of the product of these complex 
eigenvalues. We find 


det' 


/ _! 

d t 

-y\ 


d 

dr ) 


= Det' 


dr 2 


■y\ 


n(^) 2 n 


n> 1 


= FI (>1 - (2w n/p) 2 ) 

i _ /mV 

\2nn J 


;in fiy\/2 Y 

Vi/2 ) 


(12.188) 


Now the index is expressed as 


r 2 n .(./i n 

inde=i "/n^n 5 


yj ! 2 


\[2rc ' u 1 sin/ty/2' 

X — 1 J — 1 

The product with respect to j is written as 


(12.189) 


P d ' 2 y sin (571/2 J 


1/2 


Note that any Taylor expansion with respect to 1Z terminates at finite order since 
1 Ip = 0 for p > d/ 2. 

We have evaluated the contributions of the second-order fluctuations around 
a particular pair xo, i/L) so far. Now we need to take the contributions coming 
from all the solutions to the classical equations of motion into account. We have 
noted before that the set .V) of the solutions of the equations of motion contains 
the constant solution (xo, i//o) as a subset and that the contributions from non- 
constant solutions are exponentially small as f3 4- 0. Therefore, we neglect all 
periodic solutions except for constant solutions. If we note the expansion 


x^ = xR + 


VP 


we find that the integral over xo is equivalent with that over £o/V?> namely 
dxg = d^ /VP- This argument is also applied to the Grassmannian zero mode 



and we find di//^ = \fpdrj^ . In summary, the index is now written as 


ind Q — i‘ 


r 2n 

7 n 


^-dV— 

L* V2* 0 P d ' 2 


det 


\ 1/2 
\ smfiiZ/2 I 


( mti 


(12.190) 


We make the following change of variables to erase the apparent /) -dependence 
of the index, 


Vtf = • d K = VlxP d Xo • 


Substituting 


i uv — — —Tt/i vpcr Xq Xq 


into the integrand, we obtain 


ind Q = i 


• 2n 

rj n dv o d x 

M=1 


o det 


/ IJ_I 

2 2?r 2 


Ttfivpo (xo)Xo Xo 


1/2 


sin ^ ^ ^ t*o) Xq Xo 7 


(12.191) 


This is the Atiyah-Singer index theorem for the Dirac operator. 

Let us rewrite the previous theorem in a more familiar form. Note that only 
terms of order 2 n in y in the integrand yield non-vanishing contributions upon 
integration over J~[ d . Note also that |~[ djc^ is just an ordinary volume element. 
Then define the curvature two-form 


TZpv = jRixvpa dx p A <bc a . (12.192) 

Then note that TZ/ sin 1Z is even in 1Z and, hence, the integral is non-vanishing 
only when n is even, that is only when d is a multiple of four. If this is the case, 
the factor i" takes only ± 1 and we can formally replace the integrand as 

K , ^ 

sin 1Z sinh 1Z 

The reader should verify the first few terms. Then the index is now written in the 
well-known form as 


ind Q — I det 

Jm 


/ iJ-tt \ 

2 2: x 


1/2 


1 1 

sinh 7 Z 

2 2tc 


We, moreover, define the ,4-genus. Since 7 Z is anti-symmetric, it can be block- 



diagonalized as 


/ 0 x\ 

—x\ 0 


2jt n » v ~ 


\ 

0 x n 
- X n 0 ) 


Then define the ,4 -genus of M by 


mm ) = n 

7=1 


V 2 

sinh xj /2 


(12.193) 


where the RHS is defined by its formal expansion with respect to xj . 

In summary, we have proved the Atiyah-Singer index theorem in the 
simplest setting (the spin complex). 

Theorem 12.6. (Index theorem for a spin complex) The index of a Dirac 
operator defined in M is 

ind Q = I A(M). (12.194) 

Jm 


Problems 

12.1 In the text, we dealt only with compact manifolds. The extension of the AS 
index theorem to non-compact manifolds is the Callias-Bott-Seely index theorem 
(Callias 1978, Bott and Seely 1978). Here we consider the simplest case studied 
by Hirayama (1983). Consider a pair of operators 

Id tld 

L = \W{x) V = + i W(x) 

i d.v i d.v 

where W(+cxd) = [i and W(— oo) — X. 

(a) Show that Spec' L ' L = Spec' LL * , where the prime indicates that the zero 
eigenvalues are omitted. 

(b) Show that 


z \ 

L^L + z LV+z) 


/ M \ 

2 \(p? + z) 112 (X 2 + z) 1/2 J 


J(z) = tr 
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ANOMALIES IN GAUGE FIELD THEORIES 


In particle physics, symmetry principles are some of the most important 
concepts in model building. Symmetries play crucial roles for the theory to be 
renormalizable and unitary. The Lagrangian must be chosen so that it fulfils 
the observed symmetry. Note, however, that the symmetry of the Lagrangian is 
classical. There is no warranty that symmetry of the Lagrangian may be elevated 
to a quantum symmetry, i.e., the symmetry of the effective action. If the classical 
symmetry of the Lagrangian cannot be maintained in the process of quantization, 
the theory is said to have an anomaly. There are many types of anomaly: the chiral 
anomaly, gauge anomaly, gravitational anomaly, supersymmetry anomaly and so 
on. Each adjective refers to the symmetry under consideration. In the present 
chapter we look at the geometrical and topological structures of the anomalies 
appearing in gauge theories. 

We follow closely Alvarez-Gaume (1986), Alvarez-Gaume and Ginsparg 
(1985) and Sumitani (1985). See Rennie (1990) and Bartlmann (1996) for a 
complete analysis of the subject. Mickelsson (1989) and Nash (1991) have a 
section on anomalies from a more mathematical point of view. 


13.1 Introduction 

Before we introduce topological and geometrical methods to anomalies, we give 
a brief survey of the subject here. Let \jr be a massless Dirac field in four- 
dimensional space interacting with an external gauge field A /; = Ap a T a , where 
{ T a } is the set of anti-Hermitian generators of the gauge group G which is 
compact and semisimple (SU(fV), for example). The theory is described by the 
Lagrangian 

£ = (13.1) 

The Lagrangian is invariant under the usual (local) gauge transformation 

it(x) -> g~ l f{x) A^(x) -> g~ l [Ap{x) + dp] g. (13.2) 
It also has a global symmetry, 

i jr[x) -> e ly5 "i jr(x) i^U) -> \ jr(x)e iysa 


(13.3) 



called the chiral symmetry. The chiral current j 5 derived from this symmetry is 


is = 


(13.4) 


In general, whether the symmetry of a Lagrangian is retained under quantization 
is not a trivial question. In fact, it has been shown that the chiral symmetry of C 
is destroyed at the quantum level. Adler (1969) and Bell and Jackiw (1969) have 
shown by computing the triangle diagram with an external axial current and two 
external vector currents that the naive conservation law = 0 is violated, 


Ms - 


1 


16? r 2 
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^trS^v 

€ Kkllv d K (a x ^A v + ^A X A„A 



(13.5) 


where tr is a trace over the group indices. The current /7 l which appears in (13.5) 
has no group index, and, hence, (13.5) is called the Abelian anomaly. 

It is interesting to study the behaviour of a current which carries the group 
index. Consider a Weyl fermion 1 jr which couples with an external gauge 
field. The non-Abelian gauge current of the theory also satisfies an anomalous 
conservation law which defines the non-Abelian anomaly. The action is given 
by 

C = 9 ± = i(/±y 5 ). (13.6) 

The Lagrangian has the gauge symmetry 

A,,-* g-'iA^ + d^g (13.7) 


The corresponding non-Abelian current is 


(13.8) 

It has been shown by Bardeen (1969) and Gross and Jackiw (1972) that, up to the 
one-loop level, the current is not conserved. 


CD wiT = ^ tr T a d K e KkfXV (A x d„A v + ^AxA^aJ) . 


(13.9) 


At first sight, the RHSs of (13.5) and (13.9) look very similar. However, the 
difference between the normalization and the numerical factors of 4 and 4 have 
a deep topological origin. We shall see later that the Abelian anomaly in (21 + 2) 
dimensions and the non-Abelian anomaly in 21 dimensions are closely related but 
in an unexpected manner. 



13.2 Abelian anomalies 


Henceforth, we work in an even-dimensional manifold M (dim M — m — 21) 
with a Euclidean signature. Four-dimensional results will readily be obtained by 
putting i7i — 4. We assume our system is non-chiral, namely, the gauge field 
couples to the right and the left components in the same way. Our convention is 

yrt = yt l {y /J ',y v j = 2S^ lv y m+1 = (i)V ...y m 

y m+lt = y m+l (y m+l f = +/. 

The Lie group generators { T a } satisfy 

T\ = -T a [T a , Tp] = f a p y Ty ti(T a T p ) = -\& aP . 

13.2.1 Fujikawa’s method 

Among several methods of deriving anomalies, Fujikawa’s way (Fujikawa 1979, 
1980, 1986) reveals the topological and geometrical nature of the problem most 
directly. This method is equivalent to the heat kernel proof of the relevant index 
theorem. 

Let f be a massless Dirac field interacting with an external non-Abelian 
gauge field A ;l . The effective action W[A] is given by 

e -W[A] _ J (13.10) 

where iV 7 = iy^V /t = iy M (9^ + + A^), with = jOj, la pY. a P being the 

spin connection of the background space. We compactify the space in such a 
way that the geometry (the spin connection) plays no role. For example, this 
can be achieved by compactifying M 4 to S 4 = K 4 U {oo}, for which the Dirac 
genus A(T M) is trivial; see example 12.5. If this is the case, the spin connection 
is irrelevant and may be dropped from i'f . The classical action / dv i// i )7 \j/ is 
invariant with respect to the chiral rotation, 

• m+ 1_, - - ; , ,/n+l _ 

if/ — y e y i/f i/f -> i/re y . (13.11) 

We expand \/j and \jr as 

= Y, aj fi 4r = YM\ (13.12) 

i i 

where a,- and /;, are anti-commuting Grassmann variables, 

[cii, aj) = 0 {bi,bj} — 0 {ai,bj} = 0 
and ij/j is an eigenvector of the Dirac operator 

iy fi — h f i ■ 


(13.13) 



Since iy is Hermitian, A; is real. Since M is compact, xj/, can be normalized as 

(fiWj) — J djct lrj(x)x/fj(x) = 8ij. 

Now the path integrals over if/ and i ]r are replaced by those over a, and bj. 
Consider an infinitesimal chiral transformation, 

xjs(x) — > x jf(x) + i a(x)y m+1 x/r(x) (13.14a) 

xj/(x) -> xjj(x) + i \jr(x)a(x)y m+1 . (13.14b) 

As usual, we take a = a (x ) to be x-dependent. Under this change, the classical 
action transforms as 

J dx xj/i'fxjf — >■ J dx (i Jr + ixjray m+ ^)tf/ (xlr + iay" !+1 i/ r ) 

= J dx^ri^x/r + i fdx[atr m +H^ + i,inar m+ 'W 

= j dx Vfiyy- — J dx [orxJry m+1 y^(d lx + A^xjr 
+ ^K M (3M+^)(«K m+ V)] 

= /dx W + /dx«(x)a^ +l( x) (13.15) 

where we have used the anti-commutation relations {y 1 *, y m+1 ] = 0 and 

jm+iM = 1 A(x)y /i }/ m+ V(x) (13.16) 

is the chiral current. This is the higher-dimensional analogue of defined 
previously. If (13.15) were the only change caused by (13.14), naive application 
of the Ward-Takahashi relation would imply the conservation of the axial current 
9/rJm+i = 0. In quantum theory, however, we have an additional change, namely 
the change of the path integral measure. Define the chiral-rotated fields by 


xjr' = xjr + i ay m+1 xl/ — a\ xfrj 

(13.17a) 

\[r' — xjr + ix[ray 

m+1 = E^- 

(13.17b) 

Now the measure changes as 



J ]~~[ da; dbj - 

► J n^;- 

(13.18) 


From the orthonormality of {V'v}* we find that 

a'i = (iAil(l + iay m+ 1 )ir) 

= X^ i 'K 1 + i ay m+l )tj) a j = Cijaj 
j j 


(13.19a) 



where 


Qj = {ft |(1 + i oty m+l )fj) = Sij + i a(fi \y m+l irj). (13.20) 

The measure in terms of the new variables is 

j"~J da' i = [det C,- ; -] _1 |~ [ da, = exp(— trln Cjj ) |~ | da,- 
= exp[— tr In (7 + ia{fi \y m+l fj))] ]~[ da, 
exp(— tria(i/f,- \y m+l ^kj)) |~[ da,- 

= exp - ia 'Y2,{fj\y m+X jfj)\ |~[ da,- (13.21) 

' i ' 

where the inverse of the determinant appears since a, and a' are Grassmann 
variables, see Berezin (1966). 1 As for b ,- — > b';, we have 

b'i = ^^(^1(1 + wy m+1 Mi) = J2 c jib>. (13.19b) 

j J 


The Jacobian for the change bj 1/ agrees with (13.21). Thus, the measure 
transforms under the chiral rotation (13.17) as 


da, dbi -> Yi da 'i dbj exp - 2i J dx a(x) E fl(x)y m+l 4 r n(x) 


(13.22) 


Now the effective action has two expressions: 


-W[A] _ 


= /n da,- db, exp (~J dr 

i ' 

/ n da - db\ exp ^ — J dx x/fify jf — / 


2i / d x a(x)A(x) 


where 


Mx) = i{x)y m+x i/i(x). 

i 

Since a(x) is arbitrary, we have 


(13.23) 


(13.24) 


9^ + iW = -2U(i). (13.25) 

1 See section 1.5. For example, we have f a da = f cad(ca) = 1, c e M and a being a real 
Grassmann number. This shows that d (ca) = da jc. 



Thus, naive conservation of an axial current does not hold in quantum theory. 
This non-conservation of the current j,^ +1 is called the Abelian anomaly (or 
chiral anomaly or axial anomaly). 

How is this related to the topology? Let us look at the Jacobian (13.22) 
and assume that a(x) is independent of x? The integral in (13.22) is not well 
defined and must be regularized. We introduce the Gaussian cut-off (heat kernel 
regularization) as 

J d xA(x)= J dx ^ f' i (x)y m+] fj(x)exp[-().i/M) 2 ]\M^oo 

i 

= ^](^lK m+1 exp[-(iy/M) 2 ]|^)|M^oo. (13.26) 

In (13.26), 1/M 2 corresponds to the ‘time’ parameter t in the previous chapter 
and M — * o o implies t — * e. Let |i jr/) be an eigenstate of i^ with non-vanishing 
eigenvalue X,. Among the eigenstates, there exists a state |i /q) x = y m+1 \i[r) with 
eigenvalue —X, : 

i?|tfr/> x = tfy m+l \xlfi) = -y m+1 iy\fi) 

= -^Y m+l m = -XiWi)* 

where use has been made of the anti-commutation relation {y m+1 , i)^} = 0. Since 
if is a Hermitian operator, eigenvectors which belong to different eigenvalues are 
orthogonal, hence (i/r,- |i/q} x = (i'i\y m+l \i'i) — 0. This shows that 

(fi\y m+l exp[-(iy/M) 2 ]|^> = (^|y m+1 |^)exp[-(k,'/M) 2 ] = 0. 

Thus, the contribution to the RHS of (13.26) comes only from the zero-energy 
modes. Let 1 0, /) be the zero-energy modes of i'tf , (1 < i < no). They are 
not in an irreducible representation of the spin algebra and should be classified 
according to the eigenvalue of y m+l . We write 

y m+1 |0, i)± = ±|0, i)±. (13.27) 

Then, (13.26) becomes 

j d xA(x)= J]( 1 A 1 '|K" i+1 exp[-(i'y/M) 2 ]| 1 / f ,>|M^oo 
= ^ + (0,i|0,i) + - £_<0,i|0,i>_ 

i i 

= v + — V- = indiy + (13.28) 

where v + ( i>_ ) is the number of zero-energy modes with positive (negative) 
chirality (v+ + u_ = no) and if + is defined by 

-*=U° + ■:-) 

2 We are looking at the zero-momentum Ward-Takahashi relation. 



The Atiyah-Singer index theorem now comes into the problem. 

To show that (13.28), indeed, represents an integral of the relevant Chern 
character, we first note that 

(i^) 2 = - y VV^V,, = -{8^ + \[y», V„} + 3>] 

= -V M V' i -I[ y^.y^v (13.29) 


where use has been made of the relation [V /t , V„] = T /xv . Then 
A(x) = ^(^W(x|y m+1 exp[(V 2 + i[y^,y'']T /xl) )/M 2 ]|^>|M^oo. (13.30) 

i 


Let us take m — 4 for definiteness. We introduce the plane wave basis as 

f d 4 k 

(x\ fi) = J ^Wt)WW. 

Then (13.30) becomes 

f dk f d k' , , 

i 

x y m+1 exp[(V 2 + ±[y", y v ]?^)/M 2 ](x \k)(k\fi) 

H M->o o 

/ dk 

—^try m+l exp[(-k 2 + l[y^y v W^)/M 2 ] M ^oo (13.31) 
where use has been made of the completeness property 


= (27T) 4 S 4 (k - k'). 


In (13.31), we have replaced V 2 by the symbol —k 2 since the residual terms 
containing A do not survive in the limit M — » oo. If we put k 1 ' = k^/M, 
(13.31) becomes 

/ d^ 

^-^exp(-P). 

We expand the first exponential and use 

try 5 = try 5 y^y y = 0 tr y 5 y K y^y^y v = —4e K ^^ v 


d£ exp(— k 2 ) = jt 2 



to obtain 


Mx) = 2 tr 
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-1 

32?r 2 




(13.32) 


Note that the higher-order terms in the expansion of the exponential vanish in the 
limit M -> oo. The anomalous conservation law (13.25) now becomes 


B^ = ^tr^S^v 

= ^ tr[e K ^ v d K (Aid„A v + fAx-A^)]. (13.33) 

This is regarded as a local version of the AS index theorem. Let us write (13.33) 
in terms of the field strength T = jT /xu dx M A dx v . We easily verify that 


v + — V- 




ch 2 (T). 


(13.34) 


This is the index theorem for a twisted spinor complex with trivial background 
geometry (A (TM) — 1). 

For dim M = in = 21, we have the following identity: 

u+ _ = L At 3 '‘ i, " + ' = L ct “ m = L s tt (§) ■ <i3 - 35) 


13.3 Non-Abelian anomalies 


In the last section we considered the chiral current which is a gauge singlet (no 
gauge indices). Now we turn to the study of the gauge current j ,l a where a is the 
gauge index. Here we consider a chiral theory in which the gauge held A couples 
only to the left-handed Weyl fermion i jr. Suppose i// transforms in a complex 
representation r of the gauge group G. For example, suppose i jr belongs to a 3 of 
SU(3). The effective action W r [.A] is given by 


e -w r [A] 


/ 


Vifr'Di jr exp 


— J chn/fiy H 


ifr 


where 

i^+ = iy fl (d tl +A»)9 + !P± = i(l ± y m+l ). 


The gauge current is 

Fa = i iry^Ta'J’+ir. 


(13.36) 


(13.37) 

(13.38) 



Let v = v a T a be an infinitesimal gauge transformation parameter, g — 1 — v 
under which we have 


A u 


(1 + V)(Au + d)(l — V) — An 




(13.39) 


where D^v = d^v + [A IL , u] is the covariant derivative for a field in the adjoint 
representation. The effective action transforms as 

W r [A\ -> W r [A-T>v] 


dv tr ( r Dv—W l [A] 

O i/i 


/ 

I dx tr(3 /i ii“ + f afiy A/vy)j^W r [A] 

/ 


Since 


we obtain 


= W r [A] 
= W r [A ] 


= W r [A]+ I dx tr ( v T>——W r [A], 

oA 


: W r [A] = (ijry^T, U(1 + y m+l )f) A = (j\) 


(13.40) 


8A,. 


W r [A - Du] - W r [A] = f dx tr(t>“l DuU 1 *)*)- (13-41) 

We are naively tempted to regard (13.36) as det(i^ ) = ]~[ a' , Xj being the 
‘eigenvalue’ of iV 7 . A subtlety arises here: i^+ maps sections of .S’ + <Ei E to those 
of S- <8> E, where E is the vector bundle associated with the G bundle and S± 
are spin bundles with chirality ±. Accordingly, the equation iy + i fr — Xxj/ is 
meaningless. To avoid this difficulty, we formally introduce a Dirac spinor \ji and 
define 

e -w r [A] = J Vx/sV^r exp (^- J dxx fiDxjr\ (13.42) 


where i D is defined by 


i D = iy M (3/ t + iA /i 5 > +) 


i) 


0 


(13.43) 


where we have diagonalized y m+l . In (13.43), the gauge field A couples only 
to the positive chirality field. Now the eigenvalue problem iDi//, = X, r//,- is well 
defined. Note that i D is not Hermitian and A.,- is a complex number in general. 
Moreover, we need to introduce right and left eigenfunctions separately by 


i Dx/fj = Xixfrj 
X~ i (i D) = a ( x + , 


(13.44a) 


(i Dfxi = } -i Xi ■ 


(13.44b) 



(13.45) 


Since f X; ''frj dx = 0 for i ^ j, we may choose an orthonormal basis, 

J x t iV f ; dx = 8jj. 

It should be noted that the eigenvalue Xj is not gauge invariant. This follows from 
the observation that 


gdDiA^g- 1 = giy^A + g~ l (Ai + A)8A]g~ l 

= iD(A) - i hg~ l + i fag~ l y+ ± mA). (13.46) 

If the equality were to hold in (13.46), g~ l in would satisfy iD(A 8 )g~ l A — 
Xjg~ l xl/i when iD{A)4ri — Then Spec \D(A) would be gauge invariant. 

Although individual eigenvalues are not gauge invariant, the absolute value of the 
product of eigenvalues of i D is gauge invariant. In fact, 


det(iZ)) det((iZ)) t ) = det(iZ)(iZ))^) 

(i5f_)(iSf+) 0 

o (iy+)(iy_) 

= det(i0_i^ + ) det(iy + iy_) 


= det 


(13.47) 


where i$_|_ = (i0_)t and iy_ = (iy+)'. This is simply the Dirac determinant 
(up to an irrelevant factor det(i0_i^+)), 


[det(iy)] 2 = det ( ll7 -^+ , y+ ° iy _ ^ = [det(iy+iy_)] 2 (13.48) 

where iy is given by 

» = (iy° + "o). <1349) 

The Dirac determinant is gauge invariant, hence so is | det(iD) | . It then follows 
that Re W r [.A] is gauge invariant since 

exp(— W r [A]) exp(— W r [A\) — det(iD) det((iD)^) oc det(iy + iy_) 


is gauge invariant. Therefore, only the imaginary part of W r [A], that is the phase 
of det(iD), may gain an anomalous variation under gauge transformations. 

The anomaly may be computed by evaluating the Jacobian as before. The 
functional measure is taken to be ]~[ ( do, d b-,. We consider an infinitesimal gauge 
transformation, 

A — > A — Dv ij/ -> y + in/r+ tjr \ Jr — ijr-v (13.50) 


where the gauge transformation rotates the positive chirality parts only. The 
Jacobian factor is 

j dx tru(x) y^(n|x)y m+1 (x|w) 

•' n 


(13.51) 



where (xjn) = i//„(x) and (n \x) = /„(x) (note that (n| is nor the Hermitian 
conjugate of |n>). This integral is ill defined and must be regularized. As before, 
we employ the Gaussian regulator, 

f dx lim tru(;c) (n\y)y m+l (x\eT [ '^ ] / M \n) 

J M- s-oo z — ' 

x — >y 


-f 


dx lim trv(x)y m+i e 

M—>oo 
x^-y 


where use has been made of the completeness relation 

Y l«)(«l = L 


<5(x - y) (13.52) 


(13.53) 


It follows from (13.41) and (13.52) that 
5 


/ 


dx v ar D h 


SA, 


W r [A] \ = f dx lim tr[uy m+1 e“ (i6 * )2/M2 <5(x - y)]. 
J J M—*oo 
x^-y 


(13.54) 

In the present case W r really changes under (13.50). The trace may be written as 


tr[uy m+1 e“ (i ^ )2/M2 ] = tr[u((P + - ?_)e“ (i My+)-(iy-isi+)/M 2 -| 


= tr[u/’ + e 


my mm a 


] — tr[uT’_e 


mm/M- 


]. (13.55) 


(13.55) can be evaluated in the plane wave basis, which is straightforward but 
tedious (see Gross and Jackiw (1972), for example). We derive the non-Abelian 
anomaly from a topological viewpoint in the next section. For m — 4, the 
anomalous variation is 

W r [A - Vv] - W r [A] = J dx v a D lx {j^) a 

= 2^5 / dx \i{v a T a e Kkllv d K [A^A v + \AxA„A v ]} 

= / tr{ljd[ - /LdA + (13.56) 

The anomalous divergence of the gauge current is 

= 24^2 tr{7 ae^WxdnAv + \A X A„A V ]}. (13.57) 

This should be compared with (13.33). There are two differences between these 
results: the two-thirds in front of A 3 is replaced by a half and the overall factor is 
different. 



13.4 The Wess-Zumino consistency conditions 

13.4.1 The Becchi-Rouet-Stora operator and the Faddeev-Popov ghost 

Let W[A] be the effective action of the Weyl fermion in the complex 
representation r of the gauge group G. 3 In the previous section, we observed that 
the change of W[A] under an infinitesimal gauge transformation & V A — — Dv is 
given by 


S V W[A] = -J (T>^v) a j^W[A] = J (13.58) 

Following Stora (1984) and Zumino (1985) we introduce the BRS operator 
§ and the Faddeev-Popov ghost a>. Let Q"'(G) be the set of maps from S m to G. 4 
In addition to the ordinary exterior derivative d, we introduce another exterior 
derivative § on Q"‘ ( G ) which we call the Becchi-Rouet-Stora (BRS) operator. 
In general, § is defined on an infinite-dimensional space but we may also consider 
the restriction of § to a finite-dimensional compact subspace of Q m (G), such as 
S n , parametrized by X a . Then S may be written as S = dX a d/dX a . We require 
that d and § be anti-derivatives. 


d 2 = S 2 = dS + Sd = 0. (13.59) 

If we define A = d + S, A is clearly nilpotent, 

A 2 = d 2 + dS + Sd + S 2 = 0. (13.60) 

Under the action of g — g(x, X a ), A transforms as 

A-> A = g~ l (A + d)g. (13.61) 

Note that A is independent of X while A depends on X through g. Define the 

Faddeev-Popov (FP) ghost by 


a> = g l Sg. 


(13.62) 


The actions of § on A and w are found to be 


SA =S[g l (A + d)g] = —g l SgA-g ^Sg + g x S(dg) 

= -coA- (A- g~ l dg)cw - g~ l d(Sg) 

= — coA — Aw — dco = — 'Dam (13.63a) 

Sw^-g-'Sgg-igg^ -or. (13.63b) 


3 We drop the representation index r to simplify the expression. 

4 The set Q m (G ) should not be confused with the set of m-forms on M. The distinction 

should be clear from the context. 



It is easy to verify that S is nilpotent on A and w and, hence, on any polynomial 
of A and a> as it should be; see exercise 13.1. Define the field strength of A by 

F = dA + A 2 = g~ l 3'g. (13.64) 

We also define 

A = g -1 (.A + A)g = A + g~ l Sg — A + w (13.65a) 

F= AA + A 2 = g~ l 3g = F (13.65b) 

where (13.65b) follows since 9“ — d A + A 2 = A A + A 2 (note that §A = 0). It 
is found from theorem 10.1 that A is an Ehresmann connection on the principal 
bundle and F its associated curvature two-form. 

The existence of a non-Abelian anomaly implies that W[A] does not vanish 
under the action of the BRS operator § (a> roughly corresponds to v; see (13.39) 
and (13.63a)), 

S W[A] = G[co,A]. (13.66) 

Since W[A] is independent of co, S acts through A only. Before we write down 
the Wess-Zumino consistency condition for the non-Abelian anomaly, we stop 
here and consider the physical meaning of the BRS operator and the FP ghost. 

Exercise 13.1. Verify from (13.63) that the actions of S on A and co are nilpotent, 

S 2 A = 0 § 2 co = 0. (13.67) 

13.4.2 The BRS operator, FP ghost and moduli space 

To find the physical meaning of S and ok we need to examine the topology of 
the gauge fields (Atiyah and lones 1978, Singer 1985, Sumitani 1985). Let 21 
be the space of all gauge potential configurations on S m . For definiteness, we 
take m — 4 but the generalization to arbitrary m is obvious. The topology 
of 21 is trivial since, for any gauge potential configurations A\ and .4.2, the 
combination tA\ + (1 — t)Ai (0 < t < 1) is again a gauge potential on S 4 . 
Note, however, that 21 does not describe the physical configuration space of the 
gauge theory. We have to identify those field configurations which are connected 
by G-gauge transformations. Let 0 be the space of all gauge transformations on 
S 4 (0 = Q 4 (G) in our previous notation). Then the physical configuration space 
must be identified with 21/0, called the moduli space of the gauge theory. We 
have seen in section 10.5 that the gauge field configuration on S 4 is classified by 
the transition function g : S 3 -> G, S 3 being the equator of .S' 4 . In the present 
case, 21/0 is classified by the transition function on the equator S 3 -» G and, 
hence, 

21/0 ~ ^ 3 (G). (13.68) 

Thus, each connected component of 21/0 is labelled by the instanton number k. 
This component is denoted by L? 4 (G). 




a/e 


Figure 13.1. The BRS operator S is the restriction of 8 along the fibre. 

We note that the space 21 has a natural projection n : 21 — > 21/0 and can 
be made into a fibre bundle whose fibre is 0, see figure 13.1. Let a e 21 be a 
representative of the class [a] e 21/0 and let 

A(x) = g~ 1 (x)(a(x ) + d)g(.r) (13.69) 

be an element of 21 in [a]. We denote the exterior derivative operator in 21 by 
8, which is a functional variation and should not be confused with the usual 
derivative d; see Leinaas and Olaussen (1982). If 8 is applied on (13.69), we 
find that 


SA = -g l 8gA + g l 8ag-g l a8g - g J d(Sg) 

= g~ l Sag - d (g _1 <5g) - g~ l 8gA - Ag^.&g 
= g- l 8ag-‘B A (g- 1 8g) (13.70) 

where T) A = d + [A, ]. The first term of (13.70) represents the derivative of A 

along 21/0 while the second represents that along the fibre; see figure 13.1. The 
BRS transformation S is obtained by restricting the variation 8 along the fibre, 

SA = <5-A|fibre — ~ (13.71a) 

where the FP ghost co is g -1 Sg = fibre' a ^ so t ^ lat 

§ CO = <5tt>|fibre = -g _1 Sgg“'§g = -a? (13.71b) 


which reproduces (13.63a). 



13.4.3 The Wess-Zumino conditions 

Exercise 13.1 shows that § is nilpotent on any polynomial / of A and co, 

§ 2 f(co, A) = 0. (13.72) 

The nilpotency is required by the interpretation of § as an exterior derivative 
operator. In particular, we should have 

SG[&>, A] = S 2 W[A] = 0. (13.73) 

This condition is called the Wess-Zumino consistency condition (WZ 
condition) and can be used to determine the non-Abelian anomaly (Wess and 
Zumino 1971, Stora 1984, Zumino 1985, Zumino etal 1984). If the anomaly G is 
mathematically well defined, G should satisfy the WZ condition. This condition 
is so strong that once the first term of G[a>. A] is given, the anomaly is completely 
pinned down. 

13.4.4 Descent equations and solutions of WZ conditions 

Stora (1984) and Zumino (1985) constructed the solution of WZ conditions as 
follows. The Abelian anomaly in (21 + 2)-dimensional space is given by 

1 (\ F\ I+1 

ch '+ l(f) =(7TT y:i^J (13 - 74) 

where F — dA + A 2 , A = g _1 (A + d)g as before. Let Q 2 I+i(A, F ) be the 
Chern-Simons form of ch;+i (F), 

ch, +l (F) = dQ v +i(A, F). (13.75) 

Since the algebraic structure of the triplet (A, A, F) is exactly the same as that of 
(d. A, F ), we also have 

ch/+i(F) = A<22 /+i(A, F) = AQ 2 i+i(A + co, F) (13.76) 

where we have noted that A = A + co and F = F . If we expand Q 2 i+ l (A, F) = 
Q21+1 (A + co , F ) in powers of o>, we have 

02 /+ 1 (A, F) = Q° v+l (A, F) + Q l v (co, A, F) + Qj^ico, A, F) 

H + Qo +1 (v, A, F) (13.77) 

where Q s r is ^th order in co and r + s = 21 + 1 . 

We now note that ch/+i (IF) = ch/+i (F) since F = F = g~ 1 3 r g. In terms of 
the Chern-Simons forms, this can be expressed as 


A02/+i(A, F) = d02/+i0f, F). 


(13.78) 



Substituting (13.77) into (13.78), we have 

(d + §)[Q° 2l+1 (A,F)+Q l 2l (co,A,F) 

+ • • • + Go ' + V A, F)] = dG? /+1 (A, F). (13.79) 

If we collect terms of the same order in to, we have the ‘descent equations’ 


§Q° 2l+l (A,F) + dQ 1 2l (co,A,F) = 0 

(13.80a) 

§Q\,(a), A, F) + d Q 2 2l _ x (co, A, F) = 0 

(13.80b) 

§ Qf (to. A, F) + dQl l+1 (co, A, F) = 0 

(13.80c) 

S02'+1( w ,a,F) = O. 

(13.80d) 


Note here that S increases the degree of w by one, see (13.63). Let us look at the 
2/ -form Q\,(o), A, F). If we put 


G[co, A, F]= f Q l 2l (co,A,F) (13.81) 

Jm 

G[a>, A, F] satisfies the WZ condition, 

SG[«,A,F]= [ SQ^io), A, F) = - f dQh^ico, A, F) 

Jm Jm 

= ~ f Q^ l _ l (o),A,F) = 0 
JSM 

where we have assumed that M has no boundary and use has been made 
of (13.80b). This shows that once Q\,(co,A,F) is obtained, the anomaly 
G[ft>, A, F] is easily found. 

Proposition 13.1. Q\, defined here is given by 

/ j \ l + 1 i pi 

Q\l(co, A , T) = (^— j — — St( 1 - t) strMfAT;- 1 )]. (13.82) 

[Note: In the proof, we tentatively drop the normalization factor (i/2jr ) /_l_1 to 
simplify the expressions. This factor will be recovered at the very end.] 

Proof. We start with (11 .105), 

| d 

Q 21+1 (A + ft>, 3") = — / St tr[(A + <u)3" ? ] 

<1 Jo 



where 


£F f = t?+ (t 2 -t)(A + co) 2 

= 3 f ,+ (r - t)[A, a)} + (t 2 - t)or 
3, = &{tA) + {tA) 2 . 

If we substitute 3) into Q 21+1 and collect terms of first order in co, we have: 

1 f 1 

- / St tr[ w? 1 , + ( t 2 - t)(A[A, co]T t ~ l + A?, [A, co]^'~ 2 

I- Jo 

H h co])] 

= i J St strtwj' + ( t 2 - t)A(3\- l [A, co]] 

+ ?',- 2 [A,coW r + ---)] 

= J]J St str[ co!? 1 , + ( t 2 - t)lA[A, ujj' -1 ] 

= J]J St str[«3'' + /(r 2 - t){[A, A]co?'~ l + Aco[A, Jj -1 ])] 

= i J St str[w{3> + Kt - l)(t[A, A]?'~ l - A[A t , S'' -1 ])}] 

where str is the symmetrized trace defined by (1 1.8). Now we use 
T> t ?\~ l = dS^ -1 + [At, S'' -1 ] = 0 
35T 

-J- — dA + t[A, A] 

at 

to change the final line of the previous equation to 

^ J Ststr^co^l+IQ - 1) (^ - dA^j S' 1 ,- 1 + Ad?'- 1 j 

1 r r gy/ ] ' 

= — St str « j; + /(i-t)d(yi3';- 1 ) + (r- i)-^M . 

Integrating by parts, we find that 

Q\M a , S') = J St( 1 - t ) sh-MC/lfF;- 1 )]. 

If we recover the normalization, we finally have 
/ i \' +1 1 

Q\ l {co,A,?) = I — j — — J St(l-t)str[cod(A^- 1 )]. □ 



For m = 21 — 2 and m = 4, we have 



2 


Q\{co,A,F) = \ 
6 



tr(®d A) 


3 


str(&>d(AdA + ^A 3 )). 


(13.83b) 


(13.83a) 


These results are also verified by direct computations. Up to the normalization 
factor, (13.83b) yields the non- Abelian anomaly in four-dimensional space; see 
(13.56). 

Sumitani (1984) pointed out that the approach to the non- Abelian anomalies 
here is ad hoc and does not clarify the following points: 

(1) The WZ condition (13.73) does not fix the normalization of the anomaly and, 
moreover, the uniqueness of the solution is far from trivial. 

(2) It is not clear why we should start from the Abelian anomaly in (m + 2)- 
dimensional space. 

To answer these questions we need to develop a more elaborate index 
theorem called the family index theorem; see Atiyah and Singer (1984), Singer 
(1985) and Sumitani (1984, 1985). In the next section, we outline the physicists’ 
approach to this problem, closely following the work of Alvarez-Gaume and 
Ginsparg (1984). 

13.5 Abelian anomalies versus non- Abelian anomalies 

Let us consider an m -dimensional Euclidean space (in = 21) which is 

compactified to 5™ = W U {oo} and let G be a semisimple gauge group which 
is simply connected (like SU(A) for which 7ri(SU(lV)) is trivial). Consider a 
one-parameter family of gauge transformations g(0,x) (0 < 0 < 2n) such that 


g(0, x) = g( 2it,x) = e. 


(13.84) 


Without loss of generality, we may normalize g so that g(0, xq) = e at a point 
xq e S m . The map g : 5 1 x S'" —> G is classified according to the homotopy class 
7T m +i (G). To see this we define the smash product X A Y of topological spaces 
X and Y by the direct product X x Y with X V Y = (jco x X) U (Z x yo) shrunk 
to a point. From figure 13.2, we easily find that S 1 A S' n = S m A S l = S™ +1 . 5 
Repeated applications of this yield 


S' n A S n = 


(13.85) 


In the case which interests us, the conditions (13.84) make the direct product 
.S’ 1 x S'" look topologically like .S' 1 A S'" = S '" + 1 . Thus, g is regarded as a map 


5 The readers may convince themselves by explicitly drawing sUs 1 =S". 


Figure 13.2. The smash product S 1 A S m ~ 5"' +1 


from 5' m+1 to G and is classified by jr m +i (G). Since we have a one-parameter 
family in the space 0 = Q'" (G), we also have n m+ \ (G) = tt\ (0). In practice, 
we take G = SU(1V) for which we have 

7r m+ i(SU(Af)) = Z N>\m + \. (13.86) 

Now we take a ‘reference’ gauge field A in the zero instanton sector <TJ"(G) for 
which we may assume, without loss of generality, that the Dirac operator (13.49) 
has no zero modes. Consider a one -parameter family of gauge potentials 

A 8(e) (x) = g-\9,x)(A(x) + d)g(e,x ) (13.87) 

where 9 parametrizes 5 1 . In section 13.3, we observed that deti/J is gauge 
invariant (see (13.47)) and only the phase of det i D may gain an anomalous 
variation under a gauge transformation. This, in particular, implies that det i I) 
does not vanish for any 9. We write 

exp{-W,[.A g(e) ]} = detiD(A 8(e) ) = [det i^ (.A)] 1/2 exp[iiu(^l, 9)} (13.88) 

where i^ is the Dirac operator (13.49) and exp[iw(A, 9)] is the anomalous phase 
associated with the gauge transformation (13.87). Next we consider a two- 
parameter family of gauge fields A t,e (0 < t < 1) which interpolates between 
A = Oand A g(e \ 

A tfi =tA 8(e) (0 < f < 1). (13.89) 

The parameter space specified by (t , 9) is considered to be a two-dimensional unit 
disc D 2 with polar coordinates (t , 0). On the boundary of the disc, 3 D 2 = S 1 , 
the modulus of det i/3(.A l/; ) is a non-vanishing constant. The phase Q lw (A,d) now 
defines a map S l (=d D 2 ) S l (=U(1)); see figure 13.3. As we move around 



Figure 13.3. The phase of the effective action W[A S ^^] defines a map S 1 — >■ U(l) by 
6 c lw (A,0). Q n t jj e ,jj SCi there are points { pi } at which detiD(yi r ' 0 ) vanishes. The 

winding number of the map S 1 — >• U(l) is obtained by summing a winding number along 

Q. 


the boundary of the disc, the phase winds around the unit circle. The winding 
number of this map is an integer 


1 r 2n 

Ji= — / 

27 r Jo 


dw(A, 6 ) 

W 


d<9. 


(13.90) 


We find below that J\f is derived from the Abelian anomaly in (m + 2) dimensions. 


Exercise 13.2. Show that 

W[A g(2,T) ] - W[A‘ ?(0) ] = -2jtm. (13.91) 

Since g(2n) = g(0), (13.91) may be regarded as a Berry phase. 


13.5.1 m dimensions versus m + 2 dimensions 

We recall that our reference gauge field A supports no zero modes of the operator 
iD(A). Since | detiZl(yi s(0 ' ) )| = |detiD(yi)| ^ 0, the operator iD(A 8(e ^) does 
not admit zero modes either. Of course, \D{A'' d ) may have zero modes since 
A 1 ' 6 is not obtained from A by a gauge transformation in general. Suppose it 
has a zero mode at p, = ( r, , 0 , ) . We assume they are isolated points. Since 
de\.\D(A’' e ) is a regularized product of eigenvalues, it vanishes at /?, . The 
phase of &etiD(A' ,e ) may be homotopically non-trivial only around these points. 
Moreover, the winding number at p, is determined by the eigenvalue which 
vanishes at p,. For example, if X n (t , 0) vanishes at pi it should be of the form 

i Wi(t,0) 


X„(t, 6) = f{t, 0)e' 


(13.92) 




s = 1 


/= 1 


Figure 13.4. 


where /(?,-, 0,) = 0. The winding number at p is 

1 f d 

mi = — / —Wi(t, 9)ds 
2 it J Ci d^ 


(13.93) 


where C , is a small contour surrounding /?,, see figure 13.3. Continuously 
deforming the loop .S' 1 = 3 D 2 into a sum of small circles C, enclosing p\, we 
find that the total winding number is 



(13.94) 


Now we show that the winding number N is related to the index theorem 
in {in + 2)-dimensional space {m = 21): N = indi )^ m +2 where 1 $ m +2 is 
the Dirac operator on S 2 x S'" defined later. Let us consider a gauge theory 
defined on D 2 x S'" whose coordinates are (t, 0, x). To avoid the boundary term, 
we add another piece, D 2 x S'" , with coordinates ( s , 0, r), to form a manifold 
S 2 x S'" without a boundary; see figure 13.4. We call the patch (f , 0) the northern 
hemisphere U\\ and ( s , 0) the southern hemisphere Us- On the equator .S 1 of S 2 , 
we have t = s = 1 . We choose the following local gauge potentials 


A N (t,9,x) =A*’ e +g 'd e g 
yis(i, 9, x) — A 


( t , 0) e f/ N 
(j, 0) e U S 


(13.95a) 

(13.95b) 


where A is the reference gauge field introduced previously. To elevate ./In = 
yiN/i dx IJ and j4s = /Is/x dx M to the globally defined connection on the G bundle 
over S 2 x S'" we define the (m + 2) -dimensional gauge potentials 


An(L 9, x) = (A,,A e ,A^) = (0, 0, 4n m ) 
As ( 5 , 9, x) = (As^AotAh) = (0, 0, Asij,)- 


(13.96a) 

(13.96b) 


On the equator (t = s = 1), we have An = g~ l (As + A )g, where A = d+dg +d f 
(note that d,g — 0). Thus, A = {An , As } defines a global connection on S 2 x S m . 
Consider a Dirac operator Vf m +2 which couples to A. The index theorem for 
iy „ )+ 2 is given by 

indiy m+2 = N+ - N_ = [ ch,+i(F) (13.97) 

J S 2 x S"‘ 

where F = AA + A 2 and 3\f+ (Af_) is the number of + (— ) chirality zero modes 
of iy m+2 (chirality is defined in an (m + 2)-space). 

Alvarez-Gaume and Ginsparg (1984) have shown, using an adiabatic 
perturbative computation, that each winding number m,- must be ±1. Moreover, 
the Dirac operator iy m +2 has a zero mode at p, — ( f,,0, ) with (m + 2)- 
dimensional chirality x — w; = ±1. Then the total winding number Af = ^ m, 
is given by the index Af+ — Af_. Now we have 

r \ r 2n dw(A,0) 

indiy m+2 =/ ch /+1 (F) = - — / d0 — — . (13.98) 

JS 2 xS m 27 T Jo ov 

We easily find the non- Abelian anomaly from (13.98) including the 
normalization. Since ch/ + i (IF) = dQ m+ i (A, F), we have 


f ch /+ i(F) = 
JS 2 xS m 


ch/_|_i (Fn ) + 


ch/+i(F s ) 


-L 


[G™+i(AN,F N )| t =i - Got+i(As,Fs)|j=i]. 


(13.99) 


From (11.1 18), we find that 

<2m+lC%, Fn )| ; =1 - <2m + l(As, Fs)| s= l 
= Qm+\(g~ { ^g, 0) + A a m 

/ i /i 

= (-!)'(-) + (13.100) 

The index theorem is now given by 

indiy w+2 = (-l) / f-^-) + - l \ — f tr(g- l Ag) m+l . (13.101) 

\2ttJ (in + 1 )! J s l xS m 

Theorem 10.7 states that f s3 tr(g _l dg) 3 yields the winding number of the map 
g : S 3 -> SU(2). In the same manner, (13.101) represents the winding number of 
the map g : S m+ 1 -> G and is classified by n m +i(G) (note that S l A S m = S m+1 ). 

Finally, we show that the non-Abelian anomaly should be identified with 
Q) n . We first note that 



<2m+i(As, Fs) = 0 



since the integrand is independent of d9 and, thus, cannot be a volume element of 
.S’ 1 x S m . Then we have 


indiy m+2 = [ Q m+] (A g(d) + oj,+ s<0) ) (13.102) 

JS 1 xS m 

where w — g~ l d eg and = d + (A ?(0) ) 2 = g(9)~ l, J g(0). 

If the integrand in (13.102) is expanded in &>, only the term linear in dd 
contributes to the integral. This term Qj n (a>, A 8(d) , is proportional to 

d 9 A (volume element in S m ) and, hence, is a volume element of .S' 1 x S™. We 
now have 


&wW[ A] 



tr ojT> ia 


SW[A ] 

&An 


id 0 w{9 , A) = 2ni f Q) n (w , A gm , 3 gm ). 
Js m 


(13.103) 


The explicit form of Q) n is given by (13.82). For m = 4, we find that 
SW[A] 


J 


trail),, 




— 2jti f Ql(co,A 8{e) ,^ W) ) 

Js A 

= -L f troid dA g(0) + V ?W ) 3 
24n 2 Js A 2 V 


Putting 9 — 0 (g — e), we reproduce the anomalous divergence 




1 


247T 2 


tr T„€ 


kX/av ‘ 


1 

A\ d/iA v -f* — A\A^A V 


(13.104) 


(13.105) 


which is in agreement with (13.56). The present method guarantees that the 
WZ condition yields the correct result. Moreover, it reproduces the anomalous 
divergence including the normalization which cannot be fixed by the WZ 
condition alone. 


13.6 The parity anomaly in odd-dimensional spaces 

So far, we have been working in even-dimensional spaces. One of the reasons 
for this is that SO(2/ + 1 ) has real or pseudo-real spinor representations but no 
complex representations, hence no gauge anomaly is expected. However, we 
can show that gauge theories in odd-dimensional spaces have a different kind 
of anomaly called the ‘parity anomaly’, in which the parity symmetry of the 
classical action is not maintained through quantization. It should be noted that 
the parity anomaly in 2/ + 1 dimensions is related to the Abelian anomaly in 
21 + 2 dimensions as was pointed out by Alvarez-Gaume el al (1985). 



13.6.1 The parity anomaly 


Let M be a (21 + 1) -dimensional Riemannian manifold. We distinguish one 
dimension from the others; that is we assume that M is of the form Mx Jvt or 
■S 1 x 3Vt, where JVt is a 2/-dimensional compact manifold without a boundary. We 
denote the coordinate of M or S l by t while that of M is denoted by x. The index 
0 denotes the component in r-space while /i denotes that in x-space. For example, 
the components of the y -matrices are {y°,y^ (1 < jj. < 21)}. 

Define the ‘parity’ operation P by 

Ao(t, x) -* 7l p (r, x) — —Ao(—t, x) 

A„(r,x) -> 7l p (r,x) — A/, (—t, x) 
ir(t, x) Vf P (L x) — iyo%[r(—t, x) 
i ]r(t, x) —>■ Vf P (L x) = i\jf(—t, x)yo . 

The classical action is invariant under the parity operation, 

/ dtdriAi^-* - / dr dx i/r (— t, x)y°i[y°(do — Ao(—t, x)) 

+ y #4 (3ju. + A^i-t, x))]y°f(-t, x) 

= J dr dx i j/(t, x)i[y°(3o + Ao(t, x)) 

+ + -A-nit, x))]f(t, x) 

where we put t — » — t in the final line. Let us see whether this invariance is 
observed by the effective action. The effective action is given by the regularized 
product of the eigenvalues of iy . We employ the Pauli-Villars regularization to 
regulate the product, that is 


£reg = xiy* +iMxx 


(13.106) 


is added to the original Lagrangian. The Pauli-Villars regulator / is a spinor 
which obeys bosonic statistics and the limit M -* oo is understood. The 
regularized determinant is 


e -W[.A] 


detiy 

det(iy + i M) 


n 


A, 

A.; -f i M 


(13.107) 


where we noted that x is bosonic. Here A.,- is the rth eigenvalue of iy; i y if/j — 
XjiJ/j. Under the parity operation, eigenvalues change sign, 

i[K°Oo - A 0 (-t,x)) + y l (di + Aj(-t, x))]iy°\lfi(-t, x) 

= iy 0 [K 0 (-3r -Aq(t,x)) - y'(dj + Ai(r, x))]ii/r(r, x) 

= -Xiiy 0 ^i(r,x) 



where r = — t. This shows that the effective action IT [ A | transforms under the 
parity operation P as 


W[A] -* W[A P ] = -In FT — — = W[A] (13.108) 

1 1 -A + l M 

where the bar denotes complex conjugation. (13.108) shows that the imaginary 
part of W is identified with the parity-violating part 

W[A] - W[A P ] =2lmW[A]. (13.109) 

Im W[A] is given by the (/-invariant defined in section 12.8. In fact, 

ImlT[yi]= lim Im ( — In | = lim tan _ 1 (M/A,) 

M^OO V ^ A,- + \M ) M^o ' 

i i 

=I(ei-eO=^- (mio) 

A<0 ' 

Thus, the Pauli-Villars regulator gives a regularized form for the ^-invariant. We 
finally have 

Im W[A] = 1 1 , = | lim ^'sgnAilAir 2 * (13.111) 

i 

where the prime indicates the omission of zero modes. 

13.6.2 The dimensional ladder: 4-3-2 

It is remarkable that the parity anomaly (13.110) is closely related to the chiral 
anomaly in a (21 + 2) -dimensional space (Alvarez-Gaume et al 1985). Following 
Forte (1987), we look at the dimensional ladder, 

four-dimensional Abelian anomaly 
three-dimensional parity anomaly (13.112) 

I 

two-dimensional non- Abelian anomaly. 

We take M 4 = S 2 x S 2 as a four-dimensional space. The Abelian anomaly 
is given by the index 


indiy 4 = K + — [ 9,4' = [ ch 2 (F). (13.113) 

Js 2 xS 2 Js 2 xS 2 

As before, N+ (Af_) is the number of positive (negative) chirality zero modes. 
Let Q 3 be the Chern-Simons form of ch 2 (F); ch 2 (F) = d<23(A, F). Then 



N = N+ — K_ is given by 


f cho (F) = f dg 3 (A N ,F N ) + [ d(2 3 (As,Fs) 

Js 2 xS 2 Ju N xS 2 Ju s xS 2 

L 

— f 

24- JT 2 JsixS 2 


I U N xS 2 

[ Qh (An , Fn ) - dg 3 (As, Fs)] 


tr(g *d l? ) 3 


(13.114) 


where g is the gauge transformation connecting An and A§; An = g -1 (As + 
d + de)g. In the previous section, we have shown that 3\f also represents the 
non- Abelian anomaly 


N = 


~f 

2n Jo 


2 - dw(A,0) 

dO 

o 3 0 


where w is defined by 


detiD(/D (6l) ) = e iu,(yl ' e) detiDCA). 


(13.115a) 


(13.115b) 


Here A is the reference gauge potential and 

A g(e) = g-\x,Q)(A + d)g{x,0)ib = $+ 4T+. 


Next, we show that N is also related to the parity anomaly in three- 
dimensional space. Let i^ 3 be a three-dimensional Dirac operator and define 
a four-dimensional Dirac operator by 

= icr i ® I + a 2 <8> iVst-A,] (13.116) 

at 

where A t is a one-parameter family of gauge potentials interpolating Aq — A t - o 
and A\ = A t =\. The Atiyah-Patodi-Singer index theorem (section 12.8) is 

indi0 4 = — f ch 2 (T) + \[r)(t = 1) — i?(i = 0)] (13.117) 

JS 2 xS l xI 

where we have noted that the Dirac genus A is trivial on .S' 2 x .S' 1 x I . Suppose 
Aq and A\ are related by a gauge transformation, 

A x =g- 1 (A 0 + d)g (13.118a) 

and consider an interpolating potential 


At = t Ax A — t )A q . 


(13.118b) 



Since the spectrum of iV ' 3 is gauge invariant, in particular Spec -\(Au) = 
Spec i^ 3 (yLj), the ^-invariant is also gauge invariant. Then rj(t = 0) = ?;(r = 1) 
and the APS index theorem (13.117) yields 

spectral flow = indi 04 (./l() 

= f ch 2 (T)= f Q 3 (A 0 , To)] 

JS 2 xS 2 JS'xS 2 

= f Q3(g~ 1 dg,0) = M. (13.119) 

Js'xS 2 

Thus, the spectral flow of the three-dimensional theory is given by the index X. 
In summary, the map g : S 2 x S l G is understood in three different ways: 

( 1 ) g is a transition function at the boundary of two patches of a G bundle over 
S 2 x S 2 . It yields the index X of the four-dimensional Abelian anomaly. 

(2) Suppose Ao and A\ = g - 1 (A-o + d)g are gauge potentials on S 2 x S 1 . 
The gauge transformation function g measures the spectral flow X between 
Spei^flAo) and Spe i ^ 3 (-A 1 ). 

(3) g : S 2 x 5 1 — > G induces a map S 1 -> G5, the winding number X of which 
is identified with the non-Abelian anomaly in two-dimensional space. 

Thus, we have obtained the 'dimensional ladder’ 4-3-2. The extension to higher 
dimensions is obvious. 


6 


Note that there is no gauge anomaly in odd-dimensional spaces. 
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BOSONIC STRING THEORY 


In the present chapter, we study the one-loop amplitude of bosonic string 
theory. Our example is the simplest one: closed, oriented bosonic strings in 26- 
dimensional Euclidean space. 1 The action is the Polyakov action 

S=^~ f d 2 £ y /y Y aP B a Xi l dpX tl - f d 2 § y yU (14.1) 

2?r 47T Jy, s 

where Eg is a Riemann surface with genus g. The second term is proportional 
to the Euler characteristic x =2 — 2 g and, hence, determines the relative ratio 
of multi-loop amplitudes; the g-loop amplitude is proportional to exp(— Xg). We 
have not written down the possible counter terms explicitly. 

In the following sections, we work out the path integral formalism of bosonic 
strings. We first develop the necessary mathematical tools, namely differential 
geometry on Riemann surfaces. Then the path integral expression for the vacuum 
amplitude is written down. As an example, we compute the one-loop vacuum 
amplitude. Our exposition is based on D’Hoker and Phong (1986), Polchinski 
(1986) and Moore and Nelson (1986). There are many surveys of these topics, 
for example, Alvarez-Gaume and Nelson (1986), Bagger (1987), D’Hoker and 
Phong (1988) and Weinberg (1988). 

14.1 Differential geometry on Riemann surfaces 

Riemann surfaces are real two-dimensional manifolds without boundary. In our 
study of topology and geometry, we referred to them in various places. Here 
we summarize the basic facts on Riemann surfaces, which will make this chapter 
self-contained. We also introduce several new aspects of Riemann surfaces, which 
provide enough background for the study of bosonic string amplitudes. 

14.1.1 Metric and complex structure 

Let Eg be a Riemann surface of genus g. It was shown in example 7.9 that we 
may introduce, in any chart U, the isothermal coordinates (§ ! , £ 2 ) in which the 
metric is conformally flat: 

g = e 2<T ®(df 1 ® df 1 + df <g> df 2 ). (14.2) 

1 The reason for D = 26 will be clarified in section 14.2. 



Introduce the complex coordinates 


z = t' + i? 2 z = ^-i| 2 . (14.3) 


Forms and vectors are spanned by 

dz = d£ 1 +id§ 2 dz = df 1 — id£ 2 

_ i / a . a \ _ i / a .a 

z ~ 2\W ~ 1 WJ Z ~2\W + 1 W 

In terms of the complex coordinates, the metric takes the form 

g — I e 2or(J: ' z:) [dz <g> dz + dz <g> dz]. 


(14.4a) 

(14.4b) 


(14.5) 


The components of g are 

8zz = 8zz = 8zz = 8zz = 0 (14.6a) 

gzz _ gzz _ 2e _2<T = 0. (14.6b) 

Let V be another chart of E g such that U IT V / 0. Let (w, w ) be the 
complex coordinates in V. The metric in V is 

g = e 2cr '(»•“’) dw ® chb. (14.7) 

The two expressions (14.5) and (14.7) should agree on U IT V, 

e 2<r(z,z) d , 0 d - z = e 2«'(w,w) dw 0 d - 


Since 


dw <g> dw = [(dw/dz) dz + (3w/3z)dz] <g> [(dw/dz) dz + (3w/3z)dz] 
oc dz <8> dz 

we must have dw/dz — dw/dz — 0. [Another possibility, dw/dz = dw/dz = 0 
is ruled out if (z, z) and (w, w) define the same orientation.] Thus, it follows that 

w = w(z) w = w(z) (14.8) 

which verifies that E„ is a complex manifold. We also have 

e 2<x(«) = e 2cr>(w,w) |aw/az| 2 . (14.9) 


14.1.2 Vectors, forms and tensors 

Let M — Eg. The components of vector fields V z d/dz e T M + and V z d/dz € 
T M~ transform as 


v w = (dw/dz)v z 


V w = (dw/dz)V z . 


(14.10) 



The components of differential forms w z dz e Q lA> (M) and ?/>= dz e Q 11 ' 1 (M) 
transform as 


a> u , = (dw/dz) 1 co z = (dw/dz) 1 co z . (14.11) 

These are identified with sections of the holomorphic (anti-holomorphic) line 
bundles over M — Y. :< , for which the transition functions are holomorphic (anti- 
holomorphic). The metric provides a natural isomorphism between T M + and 
Q°- 1 (M) through 


= g zz y z , v z = g zz co- z . 

Similarly, T M~ is isomorphic to a) 10 (M): 

(14.12) 

<Oz = 8zzV z , V z = g zz co z . 

(14.13) 


In general, given an arbitrary tensor, the metric allows us to trade all the z-indices 
for z-indices. It is easy to see that 

1\ «2 


y Z...Z Z—Z 
Z...Z Z—Z 


q\+P2 Gl 

r tl = C 8zz) q2 (g zl ) p 2 T . (14.14) 

Pi P2 Pl+^2 Pi P2 

This correspondence is an isomorphism. For example, observe that 


T - z 

1 zz 


8 ZZ 8zzTzf = T- Z 


z V 


Thus, it is only necessary to consider tensors with pure "-indices. For these 
tensors, we assign the helicity. Since T has z-indices only, it transforms under 
z —> w as 



(14.15) 


where n € Z is given by the number of upper z-indices minus the number of lower 
Z-indices. For example, 


T zz T ww 

1 Z. 1 w — 



T zz z . 


All that matters is the difference between the number of upper indices and the 
number of lower indices. The tensor T z z is left invariant under z — >• w and is 
regarded as a scalar. The number n is called the helicity. The set of helicity -77 
tensors is denoted by T": 


T = {T 


q 

z—z 


I q- p = «}■ 


(14.16) 


The helicity characterizes the irreducible representation of U(1 ) = SO(2). 



So far we have assumed n is an integer. It can be shown that n — \ 
corresponds to the spinor field on E ? . In fact, the existence of spinors on the 
Riemann surfaces is guaranteed by the triviality of the second Stiefel-Whitney 
class of E ? . The set T 1 is identified with the holomorphic line bundle K over Eg. 
Then T 122 is the square root of K : = K — ‘X 1 where S+ is the positive- 

chirality spin bundle. Similarly, we have T _ 1 — K = Si where S_ is the 
negative-chirality spin bundle. 2 

Example 14.1. In real indices, the helicity ± 1 vectors are given by V 1 ± i V 2 . This 
follows since 

v 'w + y 2 w = (yl + iy2)dz + {yl ~ iy2)9z - 

We put V z — T 1 + iV 2 and V z = V 1 — i V 2 ~ V-. The helicity ±2 tensors are 
7’ 1 1 ± i T 22 , where 7’ is a symmetric traceless tensor of rank two. In fact, we find 



= 2(7’ 1 1 + iT n )d z ® 3- + 2(7’ 11 — i7’ 12 )3= <g> d z . 

Clearly T zz — 2(7’ 1 + iT 12 ) has helicity +2 and T zz = 2(7’ 11 — i7’ 12 ) has 
helicity -2 (note that g z ~ z g zz T zz = T zz ). 

14.1.3 Covariant derivatives 

The only non-vanishing Christoffel symbols of E g are (see (8.69)) 

r z zz = g zz d z gzz = 23 z o- r z rz = g~ zz d zgzz =2d z a. (14.17) 

For tensors in T", we define two kinds of covariant derivative: : T" — > T" +1 

and Vi n) : 7 n ->■ T" _1 . Let 

<7 

T z '" z e (q- p = »)• 

p 

We define 

v 1 T'Z-.Z — pZZy.rrZ-Z 
V (»C z...z-8 Vjt z...z 

= g ZZ [9z + (q~ p)r Z zz\T z ~ z z ... z 
= g zl d-zT z - z z ...z (14.18a) 

y(n)jz-z^ ^ _ y_7 ' z ~ z z z 

- [3z + (q - P)r z zz ]T z - z z ... z 
= (d z + 2 nd-o)T z - z z ... z . (14.18b) 


2 We use S±, instead of A±, to denote the spin bundles. The symbol A 21 is reserved for Laplacians. 



In (14.18b), 2nd z a acts like a gauge potential A. We also define covariant 
derivatives with respect to z, 


Vw = s“ v i" ) . v f ) = «« v („)- ^ 14 - 19 ) 

The curvature two-form of K and the scalar curvature associated with the 
Christoffel symbols are 

T = R z zzz dz A dz = — 3j(23-ff) dz A dz 

= — 23j3=a dz A dz (14.20a) 

n = g lz Ric lz + g zz Ric zz = — 8e _2<T 3 z 3 z er. (14.20b) 

Exercise 14.1. Verify that 

S7 z n) = 2e~ 2a 3 Z V z (n) = e~ 2ncr d z e 2na (14.21a) 

V* n) = 2e“ 2(n+1)or 3,e 2 " <T V ? ( '° = 3 Z . (14.21b) 

V: ;j ( and V z n) are mutual adjoints with respect to a properly defined inner 
product. Let T, U e T". We require that the inner product be invariant under a 
holomorphic change of the coordinate z -> w. Since 

gzz |dw/d z\~ 2 g z ~ z d 2 zV? -> d 2 w^fg 

f -» (dxvjdz) n f U -+ (dw/dz) n U. 

We find the combination 

(T, U) = J d 2 zVg(g Z z) n fU (14.22) 

is invariant under holomorphic coordinate transformations. Take T e T" and 
U e T" +1 . We find that 

(U, y 7 (n] T ) = J d 2 ze 2rr 2- n - l e 2(n+l)a U2e- 2a d- z T 

01 (partial integration) 

= - J d 2 zV«fe)“[vf +1) £/]r = (-vf +1) (/,r). 

This shows that 

C v („ ) ) t = -V(" +1) - (14.23a) 



Exercise 14.2. Show that 

(VfV = -V^_ 1V (14.23b) 

We define two kinds of Laplacian AjK : T" — >■ T' !±1 — >■ T" by 

A+ n) = -V z (n+1) V ( z n) = — 2e _2or [3-3= + 2n(3 z a)3 z ] (14.24a) 

A”, ee -V^ pVW = — 2e _2or [3-3= + 2n(d z a)d- z + 2«(3 z 3 z a)]. (14.24b) 
Then it follows that 

K) ~ A M = 4«e- 2ff (3 z 3 z a) = -\nK. (14.25) 

This shows, in particular, that 

A+ } = A" 0) (= A (0 )). (14.26) 

14.1.4 The Riemann-Roch theorem 

Here we derive a version of the Riemann-Roch theorem from the Atiyah-Singer 
index theorem following D’Hoker and Phong (1988). 

Theorem 14.1. (Riemann-Roch theorem) Let E ? be a Riemann surface of genus 

(ft') 

g. Then the index of the operator V; is 

dim c ker V, (n) - dim c ker = (2 n - l)(g - 1). (14.27) 


(ft) 

Proof. We use the heat kernel to evaluate the index. We first note that ker V, = 
ker AT, and ker V ( ; n _ 1) = ker Ai ^ (see (14.24)). The heat kernel 3C+ of 
satisfies 

+ A+ n) ) K+(Z, W- 1) = + A - l/„) K+(z, W,t) = 0 

where A = — 2d z d z is the flat-space Laplacian and 

V n = A - A+ ;) = (1 - e _2or ) A + 4ne _2<T 3 z er d z . 

The Laplacian A also defines a heat kernel by 

(4 + a) K(z.w, 0 = 0 


which is easily solved to yield 



The perturbative computation and iteration yield 


IC+(z,z'\ t) — K(z, z!\ t) 

+ J d sj dw K(z, w; t — s)V n (w)JC+(w, z'; s) 

— K(z,z\t) + J ds J dw K(z, w; t — s)V n (w)K(w, z; s) 


fds J ds' J dv J 


dw K(z, v;t — s)V n (v) 


x K(v, w; s — s')V n (w)K(w, z!\ s') 

+ ••• . 

We are particularly interested in DC+ (z, z; t), t being small, 

IC^(z,z\t) = - 1- f ds f d w K(z,w; t — s)V„(w)K(w, z\ s) + O(t). 

4?r t Jo J 


(14.28) 


If we take a coordinate system in which a = 0 at z, we have 

a(w) ~ 0 + d z o(xv — z) + 3 - z a{w — z) 

+ ±[3 2 ct(w - z) 2 + d 2 a (w - z) 2 + 2d z d~ z a\w - z| 2 ] + 


Due to rotational symmetry in two-dimensional space, only those terms with 
one z -derivative and one z-derivative survive in the integral in (14.28). Terms 
proportional to d z crd z a cancel between the second and third terms in the 
expansion and we are left with terms proportional to d z d z a. Now we have to 
evaluate 


J ds J drw K(z, w; t — s) 

x [23 ; 3jcr|w — z| 2 A u , + 4 n(w — z)d z d- z adjjj]K(w, z; j). 

From the identities 

J d 2 w K(z, w\ t — s)\w — z| 2 A w K(w, z; s) 

— 7y [ d 2 w\w\ 2 exp ( |w| 2 ^) 

167 r 2 s 2 (t-s)j 11 F V 2s (f — s) ') 

T~7 f d 2 w\w\ 4 exp ( Nl 2 ^ 

32jt 2 s 3 (t — s) J 11 F V 2s(f-s)‘ J 

(i t — s)(2s — t ) 



and 


J d 2 u> K(z, w; t — s)(z — w)dti>K(w, z; s) 

1 f 2 ( t 2 \ f — s 

32n 2 s 2 (t — s) J \ 2s(t — s) J 4tt t 2 

we find that 

/C+(z,Z-,t) = -L + ^-^Aa + 0(t). (14.29a) 

47 T t 1 ATT 

We also have the diagonal part of the heat kernel for A 7, , 


K -t i . K n = ±. + l -2l^ + ou). 


(14.29b) 


From (14.29) and (14.20b), we obtain 


ind V, ( ' !) = 


/ 


d 2 z 


1-3 n 1 + 3(n - 1) 


127T 


12?r 


Act 


1 -2 n 
8jt 


1 


d 2 x n 


2 n - 1 


-X(S ? ) = (2n - l)(g - 1) 


where 


An 


/ 


d-\r U = 2-2g 


is the Euler characteristic of 


14.2 Quantum theory of bosonic strings 

Now we are ready to introduce Polyakov’s formulation of bosonic strings, which 
is based on the path integral over geometries. Since the string action contains an 
enormous symmetry, we have to pay special attention to counting independent 
geometries once and only once. This is achieved by the Faddeev-Popov trick. 
Our argument will be restricted to the simplest case, namely closed orientable 
bosonic strings; the theory is defined on Riemann surfaces. 


14.2.1 Vacuum amplitude of Polyakov strings 

According to the general prescription of the path integral formalism, the partition 
function (vacuum-to-vacuum amplitude) of the string theory is given by 

OO OO r 

z = J2 z g =J2 / VXVye- s L x - y] 

g = o g=0 J 


(14.30) 




0 = 0 g =1 5=2 


Figure 14.1. The total vacuum amplitude is given by summing over g-loop amplitudes. 

see figure 14.1. To avoid confusion, we denote the genus by g and the metric by 
y . The sum over genera amounts to the sum over the topologies. Z g is the g-loop 
amplitude and is obtained by integrating over all metrics y and all embeddings X. 
As we shall see later, the measure VXVy is not well defined and we need some 
modifications. The string action .S' [A, y] is taken to be 

S[X, Y^ = \J d 2 § VYY af> d a X> l dl>X ll + ^ f d 2 f ^fyU. (14.31) 

The first term is the Polyakov action. The second term is proportional to the Euler 
characteristic 

X = d 2 ^n^2-2g 

and serves as the string coupling constant; the amplitude of a loop with genus g is 
suppressed by the factor e~ 2 '". Since this term is a topological invariant, it does 
not affect the dynamics of the string. We are interested in Riemann surfaces of a 
fixed genus g and drop this term. The first term of the action has the following 
symmetries (section 7.11): 

(A) Diff(E ? ), the group of diffeomorphisms / : E ? — »■ £ ? . Let f“ — > f “(f) be 
the coordinate expression for /. The new metric is the pullback of the old 
one whose coordinate component expression is 

* d$ s 

y* - fy* = a^3f?yr»- (14J2) 

The embedding also gets transformed as 

X^ -+ f*X » = X^ f. (14.33) 

The invariance of the classical action takes the form 

S[X,y] = S[f*X,f*y]. (14.34) 

(B) Weyl(Eg), the group of two-dimensional Weyl rescalings 

Yc/ft ^ Yafi — ^ Yotfi 


(14.35) 



(f ’X, e*f*Y) 


< lx M 



Gauge slice 


Figure 14.2. An element of £ x is obtained by the action of Diff(E ? ) * Weyl(E ? ) on 
an element ( X , y) in the gauge slice. 


where 4> e 3TEg). The conformal invariance of S takes the form 

S[X, y] = S[X, y]. (14.36) 

The symmetries (A) and (B) must be preserved under quantization, otherwise 
the theory has anomalies. 

According to the standard Faddeev-Popov formalism, the degrees of 
freedom corresponding to these symmetries have to be omitted when we define 
Z g . For example, the string geometry specified by the pairs (X \ . y\) and 
(X 2 , Y 2 ) should not be counted independently if they are related by an element of 
Diff(E ? ). Similarly, (X, y ) and (X, e^y) should not be counted as independent 
configurations. Unless special attention is paid, we would count the same 
configurations infinitely many times, which leads to disastrous divergences. It 
turns out that the space of all the geometries (X . y) can be separated into 
equivalence classes (the gauge slice), any two points of which cannot be 
connected by these symmetries, see figure 14.2. 

To be more mathematical, let £ be the space of all the embeddings X : 
Eg -> M° and let A4 g be the space of all the metrics defined on Eg. Naively, 
the path integral is defined over £ x M g . Because of the symmetries (A) and 
(B), however, the integral should be restricted to the quotient space (£ x j\4 g )/G 
where G — Diff(Eg) * Weyl(Eg) is the gauge group. 3 The action of (/, e^) on 
(X, y) e £ x M g is 

(/, e+XX, y) = (f*X, e+f*y). (14.37) 

The quotient Mg/G is called the moduli space of Eg and is denoted by 
Mod(£g). We are also interested in the subgroup Diffo(Eg) of Diff(Eg), which 

3 Here * denotes the semi-direct product. Note that Diff(Eg) fl Weyl(Eg) ^ 0. We shall come back 
to this point later. 




Figure 14.3. The mapping class group (MCG) is generated by Dehn twists around a,- , b, 
and Ci (1 < i < g). 


is a connected component of the identity map. The quotient space Teich(E g ) = 
,Vl„/l')iffo(E„) * Weyl (Eg) is called the Teichmiiller space of ST,. The general 
theory of Riemann surfaces shows that Teich( E g ) is a finite-dimensional universal 
covering space of Mod(Eg). Explicitly, we have 


dimjj Teich(Eg) = 


2 


6g - 6 


g = 0 
g = 1 
S>2. 


(14.38) 


The group Diff(E ? )/Diffo(E g ) is known as the modular group (MG) or the 
mapping class group (MCG). The MCG is generated by the Dehn twists defined 
in example 8.2. For the torus with genus g , the MCG is generated by 3g — 1 Dehn 
twists around a,-, £>, and c, in figure 14.3. Unfortunately, these 3g — 1 Dehn twists 
are not the minimal set of the generators. The general form of MCG for g> 2 is 
not well understood. 

From these arguments, the meaningful partition function turns out to be 




L 


VXVy 


3-slx.yl 


£xMe V (Diff * Weyl) 


(14.39) 


where V (Diff * Weyl) is the (infinite) volume of the space of Diff(E ? )*Weyl(E g ) 
and takes care of the infinite overcounting of the same geometry. The order (the 
number of elements) of MCG is denoted by |MCG| . Clearly, 


V (Diff * Weyl) = |MCG| V(Diff 0 * Weyl). (14.40) 


14.2.2 Measures of integration 

We have to define a sensible measure to carry out the integration (14.39) so that 
the physical degrees of freedom and the gauge degrees of freedom are separated. 
This separation of degrees of freedom requires the Jacobian, 


VyVX -» J(V physical ) {V gauge). 


(14.41) 



To find this Jacobian, we note that the Jacobian on a manifold M agrees with that 
on T M. To see this, let x >l (y ,x ) be a coordinate of a chart U (V) of M such that 
U fl V 0. The Jacobian of the coordinate change is J = det (dy fl /dx v ). Take 
V e T p M. In components, we have V = u^d/dx 11 = v^d/dy^, where 


v » = u v (dy^ /dx v ). (14.42) 

{n M } and {i> M } are fibre coordinates of T p M. The Jacobian J associated with this 
coordinate change is 

J = det(3i) M /3n 1 ’) = det(dy> i /dx v ) = J. (14.43) 


This shows that the Jacobian at p e M is the same as that on T p M. The Jacobian 
J depends on p but not on the vector itself, since J depends only on p. 


Example 14.2. Let (.r, y) and (r, 9) be coordinates of K 2 , where x = r cos 6 and 
y — r sin 9. The Jacobian of the coordinate change is 


J — det 


d(x,y) 

3 (r,0) 


= r. 


Let us take 


V = v x d/dx + Vyd/dy = Vrd/dr + v e d/dd e T p R 2 . 
(v x , v y ) and (v r , v$) serve as fibre coordinates of T p R 2 . Since 

v x — v r dx/d r + Vgdx/dO v y — v r dy/dr + Vgdy/dd 
the associated Jacobian J is easily calculated to be 


/ = det[3(u x , v y )/d(v r , i> 0 )] = 


dx/dr dx/80 
dy/dr dy/d9 


= J. 


Let us derive this Jacobian in an indirect but suggestive way. We normalize 
the measure d 2 i> as 4 

1= J d 2 v exp(— i||v|| 2 ) = J di>. v di> v exp[— ^(u 2 + t> 2 )]. 

We also have ||u 2 || 2 = v 2 + r 2 v p . Noting that the Jacobian is independent of v r 
and vg , we have 

1 = J J dry dvg exp[— j(u 2 + r 2 u^)] = J r~ { 


4 


This normalization of the measure differs by a constant factor from the conventional one. 



from which we find J = r. We use this procedure to find the functional measure 
of string theory. 5 

This analysis enables us to write 

VSyVSX — JV8 (physical) V8 (gauge) (14.44) 

where Sy (8X) is a small variation of the metric y (the embedding X) and is 
regarded as an element of T y { Mg) ( TxS ). The meaning of the RHS becomes 
clear in a moment. 

Consider the diffeomorphism generated by an infinitesimal vector field 8 v 
on Eg. Since Sv is infinitesimal, it belongs to Diffo(E g ) rather than the full group 
Diff(E ? ). The changes of the metric and the embedding under 8 v are (see (7.120)) 

SuYap = (Cs v Y)ap — V a 8vp + Vp8v o, = 8v a d a X. (14.45) 

The changes of y and X under an infinitesimal Weyl rescaling e' 1 ' 5 are 

SwYctp = 8<py a p SyjX = 0. (14.46) 

These changes belong to unphysical (gauge) degrees of freedom. In general, a 
small change of metric is given by 

8 Yap = Sw Yap + <5d Yap + (physical change) 

0 

= 8<t>y a p + V a 8vp + Vp8v a + St'—jYapCt ) (14.47) 

or 

where the last term is called the Teichmiiller deformation of the metric, which 
can neither be described by a diffeomorphism nor by a Weyl rescaling. As 
mentioned before, {/} is a finite set, 1 < i < n — dimjj Teich(E g ). It is 
convenient for later purposes to separate Sy into a traceless part and a part with a 
non-zero trace. We write 


Sy a p = 8(py a p + ( P\8v) a p + 8t‘T ia p{t ) 


where Tj u p is the traceless part of the Teichmiiller deformation. 


Tiap = 


dy a p 

~dF 


l 

- TjYaPY 


V 8 d YyS 
df 


The operator Pi is defined by 


(14.48) 


(14.49) 


(P\8v) a p = V a Svp + VpSv a - y a p(V y Svy) (14.50) 

and picks up the traceless part of S£,y a p while 8(p is defined by 


8(p = 8(p + 



Sv Y + trace part of 8t 


dy_ 

dt 


(14.51) 


5 It should be kept in mind that we introduce the tangent space only to obtain the Jacobian. The 
tangent space itself has no physical relevance. 



where we do not need the explicit form in the parentheses. 

As for the embeddings, we consider the quotient £/ Diff(E g ). An arbitrary 
embedding X is obtained by the action of Diff(£ ? ) on some X e £/Diff(E g ). 
Then a small change of the embedding is expressed as 

8X = Sv a d a X + 8X (14.52) 

where the first term represents the change of X generated by 8 v while the second 
is not associated with diffeomorphisms. Now the measure should look like 

V8yV8X = J d n tV8vV8<pV8X . (14.53) 

To define the measure, we need to specify a metric on the tangent space, see 
example 14.2. We restrict ourselves to the so called ultralocal metric which is 
quadratic and depends on y a p but not on dy a p. Define a metric for symmetric 
second-rank tensors by 

W&hfy = f d 2 § Vr(G afiyS + uy 0l Py yS )8h c/ p8hys (14.54a) 

where m > 0 is an arbitrary constant and 

Qcfiys _ yCXYyPS + yCtSyfiy _ y aft yY S _ (14.55) 

It is readily verified that G is the projection operator to the traceless part 
(tr G a ^ rS 8hys = y a pG a P yS 8hys — 0) while uy a Py yS is that to the trace part. 
In a finite-dimensional manifold, a metric defines a natural volume element. In 
the present case, however, the measure cannot be defined explicitly and we have 
to define it implicitly in terms of the Gaussian integral (see example 14.2), 


J V8hexv{-\\\8h\\ 2 y) = 1. (14.56a) 

Similarly, the metrics for a scalar S(p , a vector Sv and a map 8X 1 ' are defined by 
Wfy = J d 2 l; ^y8<!> 2 (14.54b) 

ll<5w|| 2 = J d 2 § y/YYap8v a 8vP (14.54c) 

\\SX\\l = f d ^JySXnXp. (14.54d) 

With these metrics, the measures are defined by 

j V8<j>exp{-\\\m 2 y)= 1 (14.56b) 

J V8v ew(-\\\8v\\ 2 y) = 1 (14.56c) 

J V8X ^{-^_\\8X\\ 2 y ) = 1. ( 14.56d) 



Exercise 14.3. Show that ||<5y ||~ and ||SX \\^ are invariant under Diff(E g ) but not 
under Weyl(E ? ). This is the possible origin of conformal anomalies, see (14.84). 

Before we proceed further, we need to clarify the overlap between Diffo (E g ) 
and Weyl(E g ). Suppose Sv e ker P \ , that is, 

P\8v = V a Svf) + Vp8 v a - y a p(V y 8v r ) = 0. (14.57) 

We find, for such Sv, that S£,y a p = (Vy8v r )y a p. A vector 8 v e ker Pi 
is identified with the conformal Killing vector (CKV), see section 7.7. It is 
important to note that <5 d and <5w yield the same metric deformations if 8<p is taken 
to be V y 8v r . Thus, the set of the CKVs is identified with the overlap between 
Diffo(E ? ) and Weyl(E g ). Let there be k independent CKVs on E g and denote 
these by <t>" (1 < s < k). It is known from the theory of Riemann surfaces that 

6 g = 0 

k= 2 g=l (14.58) 

.0 g>2. 

We separate 8 v into a part generated by the CKV, and its orthogonal complement, 
which we write as 

= ai)" + (14.59) 

The tangent vector SX is also decomposed as 

<$X = 8X + 8v a d a X^ + 8a s <i>* d a X» . (14.60) 

The functional measures now become 

V8yV8X -* J d" 8 tV8<pV8v d k 8 aV8X (14.61) 

where we noted that the t- and a-parameters are finite dimensional. 

Let Diff^-(E ? ) be the subspace of Diffo(E g ), which is orthogonal to the 
CKV. We have 

V(Diffo) = V (Diffjj - ) • V (CKV) (14.62) 

V (Diffo * Weyl) = V(Diff^) V(Weyl) 

= V (Diffo) V (Weyl )/ V (CKV). (14.63) 

Take a slice y(t) of M g . The slice is parametrized by n Teichmiiller 
parameters. Any metric y related to y by G = Diff(E g ) * Weyl(E g ) is written as 

Y = f*(e+Y) f e Diff(E g ), ^ e Weyl(E g ). (14.64) 

We express a small deformation 8y at y as a pullback of a deformation Sy at 
y = e Sr ky: Sy — f*(Sy). Note that Sy is a small diffeomorphism at the origin 



of Diffo(Eg) and, hence, can be described by a vector field Sv. As was shown 
in exercise 14.3, Diff(E g ) is the isometry of the relevant vector spaces. It then 
follows that 


\\8y\\ 2 y = \\f*(Sy)\\ 2 rY = \\Sy\\ 2 Y y = ^y. (14.65) 

At the point y, we decompose Sy as 

8y a p = 8(py a p + {P\8v) a p + 8t'T ia p (14.66) 

where 8<j> has been redefined so that it includes the trace parts of the Teichmiiller 
deformation and V a Svp + V p8v a , see (14.51). 

Exercise 14.4. Show that Tj a p at y is related to Tj a p at y as 

Tiap = ^T la p. (14.67) 

Now we are ready to give the explicit form of the measure. We first find the 
Jacobian associated with the change V8v —> V8v6 k 8a. We have 

1 = / ^ ><5v ex P(— 2ll« 5 ^ll y) 

= J J V8vd k 8aexp(-±\\8v\\l - ±||<5a s <I>* || y ) 

= J[det(<J> s , <b r )r 1/2 (14.68a) 

where 

(<*>„ <*V) = j d 2 Sy/yy aP <l>“*?. (14.68b) 

[Remark: Although the matrix element (14.68b) is defined for y — e^y, we can 
show that it is independent of e^. To see this, let us take a CKV 3>“ of the metric 
y: V a 4 > s p + Vp$> sa — y a p V <t>(" , where V is the covariant derivative with respect 
to y and <I>. sa = y a p <J>f . A simple calculation shows that = y a p^s — e?&sa 
satisfies 

Vq.<E>^ + = e^(V„4> s ^ + Vp& sa + yaP^dyCp) 

= ^y a p(%^ + <t>Uy<P) = YvpVyi py 

V being the covariant derivative with respect to y . Thus, <t>“ = O" is a CKV 
of the metric y — e^y and the CKV are taken to be cp independent.] Equation 
(14.68a) shows that 

V8v = [det(<D,, ct> s )] l/2 V8vd k 8a. (14.69) 

Now the total measure is written as 

/[det(4> r , ch s )] 1/2 d''f V8<pV8v & k 8 aV8X (14.70) 



where J takes care of the rest of the variable changes. 

The Jacobian J is now obtained from (14.60), (14.66), (14.70) and the 
definition of the measures (14.56). We have 


= f V8ymXex P (-±\\8y\\ 2 Y -±\\8X\\ 2 y ) 

= /det 1/2 (4>, <1>) J d' 


8 tV8vV8<p d k 8 aV8X 


x exp 


+ (Pi8v) a p + 8t 


i dy a p 


= J 


- -\\8X+8v a d a X + 8a s <5> a s d a X\\ 
det 1/2 (0, 0) J d"<5 tV8v . . . exp(— ^ 


df 


\\MV\\ Z ) 


where 


/ Sf \ 

8cp 

V = 8 v M = 

8a 

\sx 

The matrix in the exponent of (14.71) is 

A f C f \ / A 0 

0 B f J { C B 

1 * \ f A f A 0 

0 B f B \ ** I 


dy/dt y 

Pi 

0 

o ' 

0 0 

dx 

0 • dx 

l y 


A 

C 


(14.71) 


0 

B 


(14.72) 


M f M = 


A'U + C^C C f B 
B f C B ] B 


(14.73) 


where * and ** are irrelevant. The last expression has been obtained from the 
identity, 

/ A B\_f I B \ / A- BD~ l C 0 \ 

V C D ) ~\ 0 D )\ D~ X C I ) ' 

The Gaussian integrals in (14.71) are readily evaluated to yield 

1 = Jdet 1/2 (0, 0) det _1/2 (M + M) 

= J det 1/2 (0, 0)[det(A f A) det(5 f 5)]“ 1/2 . (14.74) 

To compute det 1/,2 (A' A), we need to evaluate ||Sy || 2 . We have 

\\Sy\\ 2 y = f d 2 §Vy(G^ + My°V') 

x [8<py a p + (P\8v) a p + 8t‘ T ia p][8(pyy S + ( Pi8v) yS +8t j T jY s\ 

= 4n||<50|| 2 + ||A5C|| 2 + SPStHTi, Tj ) + 28f{Pi8v, 7}). 


(14.75) 



In general, p is not orthogonal to P\Sv. To separate 7} into parts orthogonal to 
Pi Sv and parallel to P\ Sv, we need to define the adjoint Pj 1 of Pi . Pi is an elliptic 
operator which takes a vector field into a traceless symmetric tensor field. Thus, 
P|' maps symmetric traceless tensors to vectors. For a symmetric traceless tensor 
Sh, we have 


(P l Sv,Sh) = I d 2 £ VrG^iPiSv^pShys 

= J +V fi Sv a )Sh afi 

= J d 2 l y /ySv a (-2y fi )Sh cl p = (Sv, P/<5/r) 

where the inner product in the last expression is defined by (14.54c). Thus, it 
follows that 

(P}Sh) a = -2 S/PSKp. (14.76) 

Suppose Sh is orthogonal to P]Sv. From the previous discussion, we have 
(Pi<5i>, Sh) — (Sv, Pj' Sh) = 0. Since Sv is arbitrary, Sh must be an element 
of ker Pj 1 , see figure 14.4. Now p may be separated as 

Ti = Vo Ti + Vj_Ti (14.77a) 

where the projection operators Vo and Pj_ are defined by 

Vo = 1 - Pi - p] V ± = Pi P, f . (14.77b) 

P,P| P/Pi 

It is easy to verify that Po + V\ = 1, PoP_l = 0, P' X V o = 0, P^V _l = 
Pj , PoP = 7) and Pj_p = 0 for 7’, e ker Pj' etc. Thus (14.77a) is an orthogonal 
decomposition of 7) . We write Pj p = Pi«/, where 


P/Pi 


P P- 


Let {Vv} (1 < r < n) be a real basis of kerP^, which is not necessarily 
orthonormal. Then p can be expanded as (figure 14.5) 

V = J2^Qn + Pim- (14.78) 

r 

Taking an inner product between Ti and \[r r , we find that 

Qri = ^[(lA, V f ) _1 ]r i (V f i , P)- 

5 


(14.79) 



Finally, Sy is decomposed into mutually orthogonal pieces as 

Sy — 8<py + P\(8v + St'm ) + Sf'V^Gn- (14.80a) 

Correspondingly, the space of the metric deformation {Sy} separates into the 
direct sum 

{Sy} = {conf} © {im P \ } © {ker/^}. 


(14.80b) 




Substituting (14.80a) into (14.75), we obtain 


||«5 K || 2 = 4«||,50|| 2 +||P 1 3i;|| 2 

+ St'St-i (Tj, f); l UW s , Tj)y (14.81) 


where <5u = Sv + 8t l Ui and the inverse in the last term refers to the inverse of the 
matrix (a rs ) = (( 1 / 0 - , 1 /r s )). If we put = ( St , <50, Sv), we find that 


der 1/2 (A f A)= J d" <5 tV8<f>V8v exp(— A^AVi) 

= J V8<p exp(— 2z<| |<50| | 2 ) J TOuexp(-^||Piu|| 2 ) 
x j d"5 1 exp{-^r f (7/, 0, -)[(0, Tj)8t j } 


oc (det pf P\)~ 1 / 2 


/ det (T, 0 ) 2 \ 1/2 
l det(i/r, %[r) I 


(14.82) 


Collecting the results (14.71) and (14.82), we have 
1 = J det 1/2 (<J>, O) det“ 1/2 B f B det“ 1/2 p} Pi 
The g-loop partition function is then given by 



2 , 


/ 


d n t VvV(f> detX 
V (Diff * Weyl) 


det 1/2 B t Bdet“ 1/2 (<b, O) 


X 


^det p\ P x 


det(T, 0) 2 \ ^ 

det(0, 0) J 


(14.83) 


The integral over a (the CKV) has been omitted since it is already included in the 
^-integration. Naively, the integral over v yields V (Diffjj ) and that over 0 yields 
T(Weyl). However, as exercise 14.3 shows, the measures VX and Vy depend 
on the conformal factor. Polyakov (1981) has shown that, under the conformal 
transformation y — > er^y, the measures transform as 


VX — > exp 
Vy -> exp 


(2S2 / d2? Vr^W + n«))vx 

(2S / ^ VYiY^daWpt + K^Vy. 


(14.84a) 

(14.84b) 


Thus, the measure VXVy is conformally invariant if and only if D = 26. This 
number 26 is called the critical dimension. Henceforth, we always assume that 



D — 26. Now (14.83) simplifies as 


Z„ = — 1 — f d n tVX det 1/2 B f B det“ 1/2 (cD, <t>) 
|MCG| J 


/ + det(P, 1 1rY 

x det P, Pi — — — 
l det (\j/, i fr) 


We perform the X -integration to eliminate det 12 B ' B. We have 


(14.85) 


1 = J P><5Xexp(-±||<5Z|| 2 ) 


= V8Xd k Saexp(-^\\8X + | 2 ) 

= J J P>,5Xexp(-^||<$X|| 2 ) J d^5« exp(— i||5«^<l>“a Q ,^|| 2 ) 

= J det -1 / 2 !# 1 /?) 

and hence det l 2 (/i 1 B ) is identified with the Jacobian of the transformation 
X -> (it, a). Thus, it follows that 


VX det 1 / 2 P t Pe -5 = 


V (CKV) 


(14.86) 


where V (CKV) = / d k a is the volume of the CKV. 

The integration over X is readily carried out. Let us write 


where 


J VXe~ s — J VX exp[— :j(V, AZ)] 


a = -—a a Vyy ap c 

Vy 


(14.87a) 


(14.87b) 


is the Laplacian acting on 0-forms, see (7.188). We write down the explicit form 
of the path integral (14.87a). Let i jr n be the eigenfunction of A, 


At/r„ — T ./7 xj/fi X n G [0, oo) 


(14.88) 


where i jr n are normalized as 


L; > tXn ) — J d“f ^[yfnifm — 


The eigenvalue X is non-negative since A is positive definite. Let us expand X 1 ' 
in i//„ as 


= Y. a ni'n = K + X* < e R 


(14.89) 



where X[j — Oq in) is the zero eigenfunction of A and x' 11 are the remaining 
degrees of freedom. Correspondingly, the path integral (14.87a) is written as 

J £>Aexp[-±(A, A A)] = 

= f n dfl o f n n da » ex p ( - \ z ! <a » )2 

J J n^O If ' n,fi 

da^Vdet' A)“ 13 (14.90) 




where the prime indicates that the zero mode is omitted. To integrate over the 
zero mode, we note that the normalized eigenvector xjf o is given by 6 

/ 1 A 1 / 2 


^ V/d 2 ?VF 


(14.91) 


From Aq = Oq i//q, we have 

fn<=/n^r’-‘=v( 7 ^j 


(14.92) 


where V — f J”[ dAp is the spacetime volume. Collecting the results (14.90) and 
(14.92), we find that 


J VXe~ s 


det'A \“ 13 

T&Jvv) 


(14.93) 


where we have dropped V and other irrelevant constants. 

Finally, we have obtained the expression for the g-loop partition function 




/ 


d ”t 


det(7\ i jr) 


Mod F(CKV) det 1 / 2 (i/f, i?) det 1 / 2 (<t>, «J>) 

13 

1/2 i 


, T , ... / det' A \ 

xlM,!KV { 7 ^ 7 ) 


where we have noted that 


1 


|MCG| J Teich 


/ d "' = / ' 


d n t. 


ifg > 2, the Riemann surfaces have no CKV and (14.95) reduces to 

-13 

1 ueu / . 1// 1 4 . , / ueL zx \ 

Z P = 


f „ det (T.\lf) , + wo/ det'A \ LJ 

/ d n t (det Pf Fi) 1/2 ( ... \ . (14.96) 

7 Mod det 1/2 (i/f, \/f) \fd 2 $^/yj 


(14.94) 


(14.95) 


6 Since xJ/q satisfies Ax//$ = 0, it is a harmonic function. Any harmonic function on a Riemann 
surface must be a constant by the maximum principle. 



14.2.3 Complex tensor calculus and string measure 


Since any Riemann surface admits complex structures, we may take advantage of 
this fact to compute string amplitudes. Many beautiful aspects of string theory 
are revealed only when these complex structures are explicitly taken into account. 
Here we rewrite the partition function in the language of complex differential 
geometry. 

We first fix the gauge in M „ by choosing the isothermal coordinate system 

y — [dz <8> dz + dz <g> dz] 

where Yzz. — Yzz = exp 2a. 7 Then the deformation of y under a 
diffeomorphism generated by Sv is (cf (14.45)) 


Yzz — 2V. Sv z 

<5d Yzz = v~ z + V z 8v z = ^ z (V z (I) 5u z + V z _ 1)( 5t>,). 


(14.97) 


Similarly, <$wK generated by an infinitesimal conformal change is (cf (14.46)) 


&wYzz — &(pYzz $WYzz — 0. (14.98) 

To see the action of the operator Pi on vectors, we take 8 v z e ‘X 1 and 
8v z £ T -1 . From (14.50), we find that 

(Pi8v) zz = 2V z m 8v z e 7 2 (14.99a) 

{P\8v) zz = 2V_ ( “ 1) <5u, e 7~ 2 . (14.99b) 

This shows that Pi is a map: 

^( V r <.4,00, 

Similarly, Pj maps traceless symmetric tensors to vectors. For 8h zz € ‘J 2 and 
8h zz e T -2 , we have 

(Pjsh f = V (2) Sh zz e T 1 (14.101a) 

(Pj8h) z = W z _ 2) 8h zz e 7~\ (14.101b) 

Thus, P|' is a map: 

P, f = ^ Vz } yZ j : T 2 © T“ 2 -> T 1 © T _1 . (14.102) 


7 In fact, the gauge is not uniquely fixed with this choice. We will invoke the uniformization theorem 
later to fix the gauge completely. 



The product P^ Pi is 

t / V. (2) V;, , 0 \ , , , , 

p; Pi = I z (1) , ( _ 1} iT'erU'j'er 1 . (14.103) 

V 0 V (-2) V z / 

Accordingly, the determinant in (14.96) becomes 

(det' P/Pi) 1/2 = (det' V Z (2) V ( Z 1} det' V ( Z _ 2) V Z (_1) ) 1/2 

= (det'A+ ) A-_ 1) ) 1 / 2 (14.104) 

where AjK are the Laplacians. We show that the spectrum of A^ } is the same as 
that of A^jj. Take an eigenfunction Sv z of A^, , 

A+ } <5u z = -2e“ 4o '3,e 2cr 9^u z = XSz z (14.105) 

where (14.21a) has been used. The eigenvalue A. is a non-negative real number 
(note A^ , are positive-definite Hermitian operators). Then we find 

A(-l)(Yzz$v z ) = - e~ 2cr d z e 2a d z Sv z = -e~ 2cr d z e 2a d~ z Sv z 

= ~ Yzz 2e _4<T d z e 2cr d z Sv z = \y zz M (14.106) 

which shows that Yzz& vZ i s an eigenfunction of A^jj with the same eigenvalue 
A. It is easy to see that the converse is also true, see exercise 14.5. Thus, A^ 
and A7 j) share the same eigenvalues and det' A^j } = det' A^jj. Now (14.104) 
becomes 

(det' P/Pi) 1/2 = det' A“_ 1} = det' A+ } . (14.107) 

Exercise 14.5. Let Sv z be an eigenvector of A^^ with an eigenvalue A. Show 
that y zz Sv z is an eigenvector of A^ } with the same eigenvalue. 

The physical change of the metric is the Teichmiiller deformation Sr' fit, 
where r' (//,) is the complex counterpart of t l (7)). From our experience, we 
know that the relevant part of the Teichmiiller deformation is symmetric and 
traceless in the real basis. In the complex basis, this amounts to /ij-y = nr zz = 0. 
Accordingly, the general variation of the metric is given by 

SYzz = vt l) Sv z + St‘ih zz (14.108a) 

$Yzz = WYzz (14.108b) 

where we have redefined Sep so that it includes the variation of Sy zz due to Sv 
(note that S^Yzz oc y zz ). In (14.108a), Sv does not contain the CKV, that is, 
Sv e (kerVf -0 )- 1 . 



To carry out the orthogonal decomposition of {<5y}, we need to define the 
inner products in various spaces. The most natural choices are 


l|S/zzl| 2 = 

j d 2 z^/y8y zz 8y zz 

(14.109a) 

\\Sy zz \\ 2 = 

J d 2 z sT/8y z i8y zz 

(14.109b) 

P^|| 2 = J 

d 2 z *Jyy z i8v z 8v z . 

(14.109c) 


Note that 8y zz dz (g> dz and <5jAjdz <g> dz are different tensors; we have to specify 
the inner product separately. 

Following the argument in the previous subsection, we introduce the 
orthogonal decomposition, 

S/zz = V<~ 1} Sv z + S^Hizz = Vt 1)8 Vz + ^ i( t>izz (14.110) 

where 8v = Sv + (projection of Sr 1 Hi zz into jim V; 1 ' } ) . The orthogonality of 
vS — 1) 5i5 z and <pi zz implies 

0 = (V^ 1) Sv z ,</> izz ) = J d 2 Z^/y8v z (-V{_ 2) (pi zz ) 

where we have noted that V~ = —V^_ 2) . Thus, we find that (figure 14.6) 

0 izz e kerV z _ 2) . (14.111) 

The explicit form of V ( '_ 2) shows that di<pi zz = 0, that is ker V^_ 2 , is the set 
of holomorphic tensors of helicity —2. The tensor </>, = (f>i zz . dz ® dz is called the 
quadratic differential while /z, = //,■-- dz ® dz is the Beltrami differential, see 
figure 14.7. In practical computations, it is often convenient to specify the gauge 
slice by the Beltrami differential, see later. Now we have established that 


{kerPj 1 } = {Quadratic differential} = {kerV^_ 2 j}. (14.112) 

The Riemann-Roch theorem (14.27) takes the form 

dimckerV!^ 1 ’ — dimckerV^_ 2) = 3 — 3 g. (14.113) 

Now we have separated {8y } into mutually orthogonal pieces 

{Sy} = {conf} © {im Vj -1) } © {ker V}’_ 2) } + cc (14.114) 

which should be compared with (14.80b). The measure becomes 

VSyVSX -* J d n 8rV8vV8(t>V8X d k 8a (14.115) 


C7-“ 


v7-2 


V<f'> 


(kerV<r ,) ) 1 

, Vf_ 2) 

(ker V f_ 2 )) x 




CKV = ker V*/ 11 

m 

{# = ker Vf_ 2 , 


Figure 14.6. The map V~ * 


and its adjoint V f _ 2 ^ 



Figure 14.7. The Beltrami differential {>u/ } spans the deformation tangent to the gauge 
slice while {<pj } spans ker V^_ 2) . 


where n and k are the complex dimensions of the Teichmiiller space and the CKV, 
respectively. The Jacobian is obtained by repeating the argument in the previous 
subsection and we find that 


Z,= 


-/ 


VyVX- 


1 


-s 


V (Diff*Weyl) 

f det' A+ 

/ d" r VX — — 

Jm od V (CKV) det(0, <p) det(<J>, <t>) 


det(/r, 4>)\ 2 


-s 


(14.116) 


Since we are integrating over complex variables, the power of a half in (14.96) 




does not appear in (14.1 16). The 2T -integration yields 


z 8 = 


-/ 


d"r 


I det(/r, <p)Y 


Mod V (CKV) det(0, 4>) det(<t>, O) 


x det A, 


det' A 


(1) \fd 2 zVy 


-13 


(14.117) 


14.2.4 Moduli spaces of Riemann surfaces 

The spaces Mod (Eg) and Teich(Eg) have been defined as 

Mod(Eg) = M ? /Diff(E g ) Teich(S g ) = _Mg/Diff 0 (E g ). 

They are related through MCG = Diff(Eg)/Diff 0 (Eg) as Mod(E g ) = 
Teich(E g )/MCG. We look at these objects more closely here. We first note: 


g 

dim c CKV 

CKV 

dime Teich(Eg) 

MCG 

0 

3 

SL(2, C) 

0 

SL(2, R) 

1 

1 

U(l) x U(l) 

1 

SL(2, Z) 

>2 

0 

empty 

3g-3 

? 


[Remark'. MCG for g > 2 can be expressed by 3g — 1 Dehn twists which are, 
however, not minimal.] From (14.118), we immediately conclude that Zq — 0 
since the Teichmiiller space is a single point and the volume of SL(2, C) is infinite. 
Of course, this does not imply that the three amplitudes with vertex operators 
vanish. In general, Mod(E g ) is topologically non-trivial although Teich(Eg) is. 
Teich(Eg) is a universal covering space of Mod(E g ) and the topological non- 
triviality comes from MCG. 

In actual computations, the uniformization theorem is very useful. In the 
previous subsection, we first chose the Beltrami differential //, , then changed the 
basis to (pi e ker . Our initial choice /i l is motivated by the uniformization 
theorem. 

Theorem 14.2. (Uniformization theorem) Let Eg be a torus with genus g. Then 
it is conformally related to the constant-curvature Riemann surface, which is given 
by the following: 


g 

Riemann surface 

Metric 

sign7^ 

0 

C U {oo} 

d.v 2 = ds <g> dz/ ( 1 + zz) 2 

+ 

1 

C/L 

ds 2 = ds <g> d5 

0 

>2 

H/G 

d^ 2 = ds ® d5/(Imz) 2 

- 


where L is a lattice in C (see example 8.2), H the upper half-plane and G C 
SL(2, IR) is called the Fuchsian group. The metric for g > 2 is the Poincare 
metric, see example 7.6. 



The proof of this theorem is found in Farkas and Kra (1980), for example. 
Thanks to this theorem, we may always take constant-curvature metrics to form 
the gauge slice in M g . This corresponds to a special choice of the Beltrami 
differential /x, . This slice defines the Weil-Petersson measure: 

f d"r _ f ^(Weil-Petersson) (14.120) 

J det(<j>,<j>) J 

see D’Hoker and Phong (1986). 

Exercise 14.6. Compute the scalar curvature of the metrics given in (14.119). 
Verify that they are independent of z, and z. 


14.3 One-loop amplitudes 

As an illustration of the formalism developed in the previous section, we compute 
the one-loop vacuum-to- vacuum amplitude of the closed orientable bosonic string 
theory. Since dim-TeichfE i ) = 1 and dime ker V* ^ = 1, we have 

z = f dr |(/x, 0)| 2 dct , A + / det' A 
’ ~ Juod V(CKV)(0,0)-(cF,ct>) 6 m \fd 2 ^Vy 

To evaluate (14.121) we need to take several steps. 

14.3.1 Moduli spaces, CKV, Beltrami and quadratic differentials 

In example 8.2, we have shown that the complex structure, namely the conformal 
structure, of the torus is specified by a complex parameter r (Imr > 0). Figure 

8.3 shows the moduli space 

Mod(Eg) = Mi/G = Teich(£g)/SL(2, Z) = H/ SL(2, Z) 
where H is the upper half-plane. 

Take the torus T z specified by the Teichmiiller parameter r = x\ + it 2 (tj > 
0). As a representative, we take a torus in figure 14.8. The metric in C naturally 
induces a flat metric (as guaranteed by the uniformization theorem) 

y = \[dz <8> d5 + d; <g> dz]. (14.122) 

The CKV are globally defined holomorphic vectors. We take <J> = a3/3z as 
the normalized basis of the CKV. The condition (<t>, <J>) = 1 yields j d 2 z\a\ 2 — 
r 2 1 cc | 2 = 1, that is a — l,/2 (we have dropped the phase). The vector T> 

generates translations in the complex plane, 

z z' = z + r^’ 1/,2 (u 1 + in 2 ). 
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(14.121) 


(14.123) 



Figure 14.8. The parallelogram whose complex structure is parametrized by r. 

We must note, however, that the translation is defined modulo the lattice; 
r 2 1/2 (v 1 + it> 2 ) and t 2 l ^ 2 (v 1 + it> 2 ) + (m + r n) yield the identical translation. 
This forces r 2 (tq + itr) to lie within the parallelogram of figure 14.8. Since 

t 2 = J d 2 z = r^ 1 J d 2 v 

V (CKV) is found to be 

V (CKV) = J d 2 v = x\. (14.124) 

Our next task is to evaluate the Weil-Petersson measure. On the torus there 
is one quadratic differential <p . Since (f> e T -2 is a globally defined holomorphic 
differential, it must be of the form, 

4> — adz®dz a e C. (14.125) 

To find the Beltrami differential, we evaluate the change of the metric under 
a small variation of r. For this purpose, it is convenient to introduce the 
coordinate system in figure 14.8. The point A corresponds to (1, 0) and B to 
(0, 1). Accordingly, we have z — 1 1 + t£ 2 . Under a small change Sr of the 
Teichmiiller parameter, we have, up to a conformal factor, 

|dz| 2 -* Id^ 1 + (r + 5r)df 2 | 2 = \dz + Srdf\ 2 

, , dz - dz 2 , id z 

— dz + dr— = dz + Sr- — . 

2tr 2 2r 2 

Comparing this with (14.1 10), we hnd that 


Mzz = i/2r 2 . 


(14.126) 



Here (Sr)/x is the complex conjugate of in (14.110). Of course, this is a 
reparametrization of the Teichmiiller space and does not affect the results. If the 
reader feels awkward with this, s/he may choose f as the Teichmiiller parameter. 
From (14.125) and (14.126), we have, up to irrelevant constants, 

O, </>) = J d 2 z fi zz <p zz = j^ ar 2 oc a 

(<P, (/>) = J d 2 Z<j) zz <p zz = a 2 r 2 . 


Finally, we have obtained 


IO<0)I 2 -i 

= To . 

(<P,4>) 


(14.127) 


14.3.2 The evaluation of determinants 


We first consider det' P[ — det' . Since we take a flat metric, the Laplacian 
takes quite a simple form, 


A+ } = — 23 ; 3j = A 

where A is the Laplacian defined by (14.87b). Since 

I d 2 ^Vy = J d 2 Z = T2 

the amplitude (14.121) reduces to 


C dr 

det' A / 

det' A ^ 

/ — 2 

/Mod r 2 

r 2 V 

, r 2 J 

t 

t 

t 


V (CKV) W-P / d 2 z 


(14.128) 


(14.129) 


where we have used (14.124) and (14.127). We have factorized the integrand so 
that the modular invariance is manifest, see exercise 14.7. 

Let us compute the spectrum of A. It is convenient to express the Laplacian 
in -coordinates. From 

k 1 = i(fz - rz)/2r 2 = (z - z)/2ir 2 (14.130) 

we readily find that 

A = -_L[|r| 2 (3i) 2 - 2n 3,3 2 + (3 2 ) 2 ] (14.131) 

2r^ 

where 3j = 3/3^ 1 etc. The eigenfunction satisfying the periodic boundary 
condition on the torus is 


= exp[27ri(«f 1 + m£ 2 )] (m, n) e 7L 2 . 


(14.132) 



(14.133) 


Substituting this into (14.131), we find the eigenvalue 

2: r 2 

Ki.n = — ~-(m - r n)(m - r n). 
t 2 

The determinant is expressed as an infinite product: 


det' A = Y\ 


'2n 2 


T 2 

m.n Z 


(14.134) 


the product being taken for all integers (m. n ) ^ (0, 0). 

Clearly det' A is ill defined and needs to be regularized. Let us introduce the 
Eisenstein series (Siegel 1980, Lang 1987) defined by 


E( r, s) = ^2 

m,n 


\2s 


( 14 . 135 ) 


the summation being taken for all integers (m. n ) ^ (0, 0). This series converges 
for Re 5 > 1 and can be analytically continued to the complex 5-plane. The series 
E( r, ,v) has a simple pole at s = 1 where we have a Laurent expansion, 

E( r, s) = — — + 2jt[y - In 2 - ln(V^|/;(r)| 2 )] + 0(s - 1). (14.136) 

5—1 

This expression is known as the Kronecker first limit formula and is essential 
for our purposes. In (14.136), y — 0.57721 ... is Euler’s constant and rj{ r) is the 

Dedekind 17 -function 


;/(r) = e i?rr / 12 ]~ [(1 — e 2i7r,nr ). 


n> 1 


Neglecting constant factors, we have 
det' A 


T2 


/ 1 V^'l \ m + rn \ 

= exp I - In T 2 + / _ In ^ — 


0 

= exp ( - In r 2 - — [r|£(T, 5 )] 


j=0 


= exp{— In ri[\ + E( r, 0)] — E'( r, 0)}. 
To evaluate the exponent, we note the functional equation, 
?r“ s r(5)£(r, 5 ) = 7r -n-5 ) r(i _ s)£(Ti I _ 


(14.137) 


(14.138) 

(14.139) 


Taking the limit 5 -> 0 in (14.139), we have 

c/ 1 ^ t-2i + s ) x 

sE(r,l—s) = jt E(r,s) 

T(1 -5) 

(1 — ys + • • • ) , 

= ?r(l - 25 In it + • • • )- -[£( r, 0) + E'( r, 0)5 + • • • ] 

(i + ys h — ) 

= ttE(t, 0) + [— 2(lnjr + y)E( r, 0) + E\ r, 0)]7T5 + • • • . 



From (14.136), we also have 


sE( r, 1 — s) = — 7t + 2jrs[y — ln2 — \n(^/x 2 \r](x)\ 2 )] H . 

Equating the coefficients of s° and s 1 , we find that 

E( r,0) = -l (14.140a) 

E'( r, 0) = -2[ln27r + ln(V^|ij(r)| 2 )]. (14.140b) 

Substituting (14.140) into (14.138), we obtain 

= exp [— £'(t, 0)] = r 2 |^(r)| 4 . (14.141) 

T2 

Finally, it follows from (14.129) and (14.141) that 

Zi = I |/?(t)|- 48 . (14.142) 

a Mod Xj 

A neat form of Z\ is obtained if we define the discriminant 

A(r) = (27r) 12 ?7(r) 24 . (14.143) 

Up to an irrelevant constant, the one-loop amplitude is 

z i = [ ^r 2 - 12 |A(r)|- 2 . (14.144) 

J Mod ^2 

A(r) is known as the cusp form of weight 12, implying 

( ax + b\ 

= (cr + d) u A(r) (14.145) 

cr + a ) 

and c(0) = 0, where the c(n) are the Fourier coefficients, 

A(r) = J]c(«)e 27r " iT . (14.146) 

n> 0 

Fligher genus amplitudes are given by the cusp forms of other weights, see Belavin 
and Knizhnik (1986), Moore (1986), Gilbert (1986) and Morozov (1987). 

Exercise 14. 7. Show that 

rj(x + 1) = e Rl l l2 rj{x) ?;(— 1/r) = (— i x) l ^ 2 ri{x) (14.147) 

where the branch is chosen so that > 0 if z > 0. Use this result to show 
that dr/r 2 and 12 1 ?7 (r) | — 48 are independently invariant under r — »■ x + 1 and 
r — 1 /t. 
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